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Preface 



This book is a free text intended to be the basis for a problem- 
oriented course (s) in mathematical logic and computability for students 
with some degree of mathematical sophistication. Parts I and II cover 
the basics of prepositional and first-order logic respectively, Part III 
covers the basics of computability using Turing machines and recursive 
functions, and Part IV covers Godel's Incompleteness Theorems. They 
can be used in various ways for courses of various lengths and mixes of 
material. The author typically uses Parts I and II for a one-term course 
on mathematical logic. Part III for a one-term course on computability, 
and/or much of Part III together with Part IV for a one-term course 
on computability and incompleteness. 

In keeping with the modified Moore-method, this book supplies 
definitions, problems, and statements of results, along with some ex- 
planations, examples, and hints. The intent is for the students, indi- 
vidually or in groups, to learn the material by solving the problems 
and proving the results for themselves. Besides constructive criticism, 
it will probably be necessary for the instructor to supply further hints 
or direct the students to other sources from time to time. Just how 
this text is used will, of course, depend on the instructor and students 
in question. However, it is probably not appropriate for a conventional 
lecture-based course nor for a really large class. 

The material presented in this text is somewhat stripped-down. 
Various concepts and topics that are often covered in introductory 
mathematical logic and computability courses are given very short 
shrift or omitted entirely.^ Instructors might consider having students 
do projects on additional material if they wish to to cover it. 

Prerequisites. The material in this text is largely self-contained, 
though some knowledge of (very basic) set theory and elementary num- 
ber theory is assumed at several points. A few problems and examples 
draw on concepts from other parts of mathematics; students who are 

"'^Future versions of both volumes may include more - or less! - material. Feel 
free to send suggestions, corrections, criticisms, and the like — I'll feel free to ignore 
them or use them. 
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not already familiar with these should consult texts in the appropri- 
ate subjects for the necessary definitions. What is really needed to 
get anywhere with all of the material developed here is competence in 
handling abstraction and proofs, including proofs by induction. The 
experience provided by a rigorous introductory course in abstract al- 
gebra, analysis, or discrete mathematics ought to be sufficient. 



Chapter Dependencies. The following diagram indicates how 
the parts and chapters depend on one another, with the exception 
of a few isolated problems or subsections. 
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Introduction 



What sets mathematics aside from other disciphnes is its rehance on 
proof as the principal technique for determining truth, where science, 
for example, relies on (carefully analyzed) experience. So what is a 
proof? Practically speaking, a proof is any reasoned argument accepted 
as such by other mathematicians.^ A more precise definition is needed, 
however, if one wishes to discover what mathematical reasoning can 
- or cannot - accomplish in principle. This is one of the reasons for 
studying mathematical logic, which is also pursued for its own sake 
and in order to find new tools to use in the rest of mathematics and in 
related fields. 

In any case, mathematical logic is concerned with formalizing and 
analyzing the kinds of reasoning used in the rest of mathematics. The 
point of mathematical logic is not to try to do mathematics per se 
completely formally — the practical problems involved in doing so are 
usually such as to make this an exercise in frustration — but to study 
formal logical systems as mathematical objects in their own right in 
order to (informally!) prove things about them. For this reason, the 
formal systems developed in this part and the next arc optimized to 
be easy to prove things about, rather than to be easy to use. Natural 
deductive systems such as those developed by philosophers to formalize 
logical reasoning are equally capable in principle and much easier to 
actually use, but harder to prove things about. 

Part of the problem with formalizing mathematical reasoning is the 
necessity of precisely specifying the language (s) in which it is to be 
done. The natural languages spoken by humans won't do: they are 
so complex and continually changing as to be impossible to pin down 
completely. By contrast, the languages which underly formal logical 
systems arc, like programming languages, rigidly defined but much sim- 
pler and less fiexible than natural languages. A formal logical system 
also requires the careful specification of the allowable rules of reasoning. 



^If you are not a mathematician, gentle reader, you are hereby temporarily 



promoted. 
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plus some notion of how to interpret statements in the underlying lan- 
guage and determine their truth. The real fun lies in the relationship 
between interpretation of statements, truth, and reasoning. 

The de facto standard for formalizing mathematical systems is first- 
order logic, and the main thrust of this text is studying it with a 
view to understanding some of its basic features and limitations. More 
specifically. Part I of this text is concerned with propositional logic, 
developed here as a warm-up for the development of first-order logic 
proper in Part II. 

Propositional logic attempts to make precise the relationships that 
certain connectives like not, and, or, and if . . . then are used to ex- 
press in English. While it has uses, propositional logic is not powerful 
enough to formalize most mathematical discourse. For one thing, it 
cannot handle the concepts expressed by the quantifiers all and there 
is. First-order logic adds these notions to those propositional logic 
handles, and suffices, in principle, to formalize most mathematical rea- 
soning. The greater flexibility and power of first-order logic makes it a 
good deal more complicated to work with, both in syntax and seman- 
tics. However, a number of results about propositional logic carry over 
to first-order logic with little change. 

Given that first-order logic can be used to formalize most mathe- 
matical reasoning it provides a natural context in which to ask whether 
such reasoning can be automated. This question is the Entschei- 
dungsproblem^: 

Entscheidungsproblem. Given a set S of hypotheses and some 
statement (p, is there an effective method for determining whether or 
not the hypotheses in E suffice to prove (pi 

Historically, this qTicstion arose out of David Hilbcrt's scheme to 
secure the foundations of mathematics by axiomatizing mathematics 
in first-order logic, showing that the axioms in question do not give 
rise to any contradictions, and that they suffice to prove or disprove 
every statement (which is where the Entscheidungsproblem comes in). 
If the answer to the Entscheidungsproblem were "yes" in general, the 
effective method(s) in question might put mathematicians out of busi- 
ness. . . Of course, the statement of the problem begs the question of 
what "effective method" is supposed to mean. 

In the course of trying to find a suitable formalization of the no- 
tion of "effective method" , mathematicians developed several different 



'Entscheidungsproblem = decision problem. 
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abstract models of computation in the 1930's, including recursive func- 
tions, A-calculus, Turing machines, and grammars^. Although these 
models are very different from each other in spirit and formal defini- 
tion, it turned out that they were all essentially equivalent in what they 
could do. This suggested the (empirical, not mathematical!) principle: 

Church's Thesis. A function is effectively computable in princi- 
ple in the real world if and only if it is computable by (any) one of the 
abstract models mentioned above. 

Part III explores two of the standard formalizations of the notion of 
"effective method", namely Turing machines and recursive functions, 
showing, among other things, that these two formalizations are actually 
equivalent. Part IV then uses the tools developed in Parts II ands III 
to answer the Entschcidungsproblem for first-order logic. The answer 
to the general problem is negative, by the way, though decision proce- 
dures do exist for propositional logic, and for some particular first-order 
languages and sets of hypotheses in these languages. 

Prerequisites. In principle, not much is needed by way of prior 
mathematical knowledge to define and prove the basic facts about 
propositional logic and computability. Some knowledge of the natu- 
ral numbers and a little set theory suffices; the former will be assumed 
and the latter is very briefiy summarized in Appendix A. ([10] is a 
good introduction to basic set theory in a style not unlike this book's; 
[8] is a good one in a more conventional mode.) Competence in han- 
dling abstraction and proofs, especially proofs by induction, will be 
needed, however. In principle, the experience provided by a rigorous 
introductory course in algebra, analysis, or discrete mathematics ought 
to be sufficient. 

Other Sources and Further Reading. [2], [5], [7], [12], and [13] 
are texts which go over large parts of the material covered here (and 
often much more besides), while [1] and [4] are good references for more 
advanced material. A number of the key papers in the development of 
modern mathematical logic and related topics can be found in [9] and 
[6]. Entertaining accounts of some related topics may be found in [11], 



The development of the theory of computation thus actually began before the 
development of electronic digital computers. In fact, the computers and program- 
ming languages we use today owe much to the abstract models of computation 
which preceded them. For example, the standard von Neumann architecture for 
digital computers was inspired by Turing machines and the programming language 
LISP borrows much of its structure from A-calculus. 
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[14] and[15]. Those interested in natural deductive systems might try 
[3] , which has a very clean presentation. 



Part I 
Prepositional Logic 



CHAPTER 1 



Language 

Propositional logic (sometimes called sentential or predicate logic) 
attempts to formalize the reasoning that can be done with connectives 
like not, and, or, and if . . . then. We will define the formal language 
of propositional logic, £p, by specifying its symbols and rules for as- 
sembling these symbols into the formulas of the language. 

Definition 1.1. The symbols of Cp are: 

(1) Parentheses: ( and ). 

(2) Connectives: -i and — 

(3) Atomic formulas: Aq, A\, A2, . . . , An, ■ ■ ■ 

We still need to specify the ways in which the symbols of Cp can 
be put together. 

Definition 1.2. The formulas of £p are those finite sequences or 
strings of the symbols given in Definition 1.1 which satisfy the following 
rules: 

(1) Every atomic formula is a formula. 

(2) If a is a formula, then {-'a) is a formula. 

(3) If a and /3 are formulas, then (a — > /5) is a formula. 

(4) No other sequence of symbols is a formula. 

We will often use lower-case Greek characters to represent formulas, 
as we did in the definition above, and upper-case Greek characters 
to represent sets of formulas.^ All formulas in Chapters 1-4 will be 
assumed to be formulas of Cp unless stated otherwise. 

What do these definitions mean? The parentheses are just punc- 
tuation: their only purpose is to group other symbols together. (One 
could get by without them; see Problem 1.6.) -1 and — > are supposed to 
represent the connectives not and if . . . then respectively. The atomic 
formulas, Aq, A\, . . . , are meant to represent statements that cannot 
be broken down any further using our connectives, such as "The moon 
is made of cheese." Thus, one might translate the the English sen- 
tence "If the moon is red, it is not made of cheese" into the formula 

^The Greek alphabet is given in Appendix B. 

3 



4 



1. LANGUAGE 



{Aq of Cp by using Aq to represent "The moon is red" and 

Ai to represent "The moon is made of cheese." Note that the truth 
of the formula depends on the interpretation of the atomic sentences 
which appear in it. Using the interpretations just given of Aq and Ai, 
the formula {Aq — >• {-'Ai)) is true, but if we instead use Aq and Ai 
to interpret "My telephone is ringing" and "Someone is calling me", 
respectively, {Aq — > {-^Ai)) is false. 

Definition 1.2 says that that every atomic formula is a formula and 
every other formula is built from shorter formulas using the connectives 
and parentheses in particular ways. For example, ^1123, (^2 (~'^o)), 
and (((-i^i) {Ai A7)) A7) are all formulas, but X^, {A5), 
()-iA4i, A^ — > Ay, and {A2 — > {~'Ao) are not. 

Problem 1.1. Why are the following not formulas of Cp? There 
might he more than one reason. . . 

(1) A-56 

(2) {Y A) 

(3) (A ^ A,) 

(4) A (-A5)) 

(5) (^8^9 ^1043998 

(6) (((-Ai) ^ {A, ^ A7) ^ A,) 

Problem 1.2. Show that every formula of Cp has the same number 
of left parentheses as it has of right parentheses. 

Problem 1.3. Suppose a is any formula of Cp. Let i{a) be the 
length of a as a sequence of symbols and let p{a) be the number of 
parentheses (counting both left and right parentheses) in a. What are 
the minimum and maximum values ofp{a)/i{a)? 

Problem 1.4. Suppose a is any formula of Cp. Let s{a) be the 
number of atomic formulas in a (counting repetitions) and let c{a) be 
the number of occurrences of ^ in a. Show that s{a) — c{a) + 1. 

Problem 1.5. What are the possible lengths of formulas of Cp? 
Prove it. 

Problem 1.6. Find a way for doing without parentheses or other 
punctuation symbols in defining a formal language for propositional 
logic. 

Proposition 1.7. Show that the set of formulas of Cp is countable. 

Informal Conventions. At first glance, Cp may not seem capable 
of breaking down English sentences with connectives other than not 
and if . . . then. However, the sense of many other connectives can be 
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captured by these two by using suitable circumlocutions. We will use 
the symbols A, V, and <-> to represent and, or,^ and if and only if 
respectively. Since they are not among the symbols of £p, we will use 
them as abbreviations for certain constructions involving only -i and 
— Namely, 

• {a A (3) is short for {-'{a — »■ 

• {ay P) is short for ((-la) (3), and 

• (a /9) is short for ((a (3) /\{(3 ^ a)). 

Interpreting Aq and Ai as before, for example, one could translate the 
English sentence "The moon is red and made of cheese" as {Aq A Ai). 
(Of course this is really {~'{Aq — >• (-lAi))), i.e. "It is not the case that 
if the moon is green, it is not made of cheese.") A, V, and <-> were not 
included among the official symbols of £p partly because wc can get 
by without them and partly because leaving them out makes it easier 
to prove things about Cp. 

Problem 1.8. Take a couple of English sentences with several con- 
nectives and translate them into formulas of Cp. You may use A, V, 
and <-> if appropriate. 

Problem 1.9. Write out {{a W (3) A {(3 ^ a)) using only -i and 

For the sake of readability, we will occasionally use some informal 
conventions that let us get away with writing fewer parentheses: 

• We will usually drop the outermost parentheses in a formula, 
writing a ^ (3 instead of (a — > (3) and -la instead of {-^a) . 

• We will let -i take precedence over when parentheses are 
missing, so (3 is short for {{-'a) — >• /?), and fit the 
informal connectives into this scheme by letting the order of 
precedence be -i. A, V, — >, and 

• Finally, we will group repetitions of V, A, or ^ to the 
right when parentheses are missing, so a — >• /3 — >• 7 is short for 
(a^(/5^7))- 

Just like formulas using V, A, or -1, formulas in which parentheses have 
been omitted as above are not official formulas of >Cp, they are conve- 
nient abbreviations for official formulas of £p. Note that a precedent 
for the precedence convention can be found in the way that • commonly 
takes precedence over + in writing arithmetic formulas. 

Problem 1.10. Write out -'{a ^ -^S) A/? ^ -lo; — > 7 first with the 
missing parentheses included and then as an official formula of Cp. 

^We will use or inclusively, so that ">1 or B" is still true if both of A and B 
are true. 
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The following notion will be needed later on. 

Definition 1.3. Suppose <^ is a formula of Cp. The set of subfor- 
mulas of ip, S{(p), is defined as follows. 

(1) If if is an atomic formula, then S{(p) = {<f}- 

(2) If V? is i^a), then S{<^) = S{a) U {(^a)}. 

(3) If If is {a ^ p), then S{ip) = S{a) U S{p) U {{a (3)}. 

For example, if ip is (((-lAi) — >• A^) — >• (Ag — > ^i)), then iS(<^) 
includes Ai, A7, As, {^Ai), (Ag ^ Ai), ((^^1) ^ A7), and (((-^1) ^ 
A7) ^ (As ^ Ai)) itself. 

Note that if you write out a formula with all the official parenthe- 
ses, then the subformulas are just the parts of the formula enclosed by 
matching parentheses, plus the atomic formulas. In particular, every 
formula is a subformula of itself. Note that some subformulas of for- 
mulas involving our informal abbreviations V, A, or ^ will be most 
conveniently written using these abbreviations. For example, if ip is 
A4^ AiV A4, then 

S{ilj) = { Ai, ^4, (-Ai), {A, V A4), (A4 ^ {A, V A4)) } . 

(As an exercise, where did ("i^i) come from?) 

Problem 1.11. Find all the subformulas of each of the following 
formulas. 

(1) (-((-^56) ^ A,,)) 

(2) Aq-^As^ ^{Ars --Ao) 

(3) -^Ao A -^Ai ^ -n{Ao V Ai) 

Unique Readability. The slightly paranoid — er, truly rigorous 
— might ask whether Definitions 1.1 and 1.2 actually ensure that the 
formulas of >Cp are unambiguous, i.e. can be read in only one way 

according to the rules given in Definition 1.2. To actually prove this 
one must add to Definition 1.1 the requirement that all the symbols 
of £p are distinct and that no symbol is a subsequence of any other 
symbol. With this addition, one can prove the following: 

Theorem 1.12 (Unique Readability Theorem). A formula of Cp 
must satisfy exactly one of conditions 1-3 in Definition 1.2. 
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Whether a given formula (f of Cp is true or false usually depends on 
how we interpret the atomic formulas which appear in 93. For example, 
if is the atomic formula A2 and we interpret it as "2+2 = 4" , it is true, 
but if we interpret it as "The moon is made of cheese" , it is false. Since 
we don't want to commit ourselves to a single interpretation — after 
all, we're really interested in general logical relationships — we will 
define how any assignment of truth values T ( "true" ) and F ( "false" ) 
to atomic formulas ol Cp can be extended to all other formulas. We 
will also get a reasonable definition of what it means for a formula of 
Cp io follow logically from other formulas. 

Definition 2.1. A truth assignment is a function v whose domain 
is the set of all formulas oi Cp and whose range is the set {T, F} of 
truth values, such that: 

(1) v{An) is defined for every atomic formula A^. 

(2) For any formula a. 



Given interpretations of all the atomic formulas of £p, the corre- 
sponding trTith assignment would give each atomic formula representing 
a true statement the value T and every atomic formula representing a 
false statement the value F. Note that we have not defined how to 
handle any truth values besides T and F in Cp. Logics with other 
truth values have uses, but are not relevant in most of mathematics. 

For an example of how non-atomic formulas are given truth values 
on the basis of the truth values given to their components, suppose 
w is a truth assignment such that v{Aq) = T and '^(^i) = F. Then 
v{ {{-'Ai) — > (^0 — ^1)) ) is determined from v{ {-'Ai) ) and v{ {Aq — > 





T otherwise. 



F if v{a) = T and v{P) = F 
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Ai) ) according to clause 3 of Definition 2.1. In turn, v{ ("i^i) ) is deter- 
mined from of v{Ai) according to clause 2 and v{ (Aq — > Ai) ) is deter- 
mined from v{Ai) and v{Ao) according to clause 3. Finally, by clause 1, 
our truth assignment must be defined for all atomic formulas to begin 
with; in this case, v{Ao) = T and v{Ai) = F. Thus v{ ) = T and 

vi (Ao ^A-,) )= F, so v{ ((-Ai) ^ {Ao ^ ^i)) ) = F. 

A convenient way to write out the determination of the truth value 
of a formula on a given truth assignment is to use a truth table: list all 
the subformulas of the given formula across the top in order of length 
and then fill in their truth values on the bottom from left to right. 
Except for the atomic formulas at the extreme left, the truth value of 
each subformula will depend on the truth values of the subformulas to 
its left. For the example above, one gets something like: 



Ao 


Ai 


Ml) 


{Ao ^ A,) 


Ml) ^ (^0 ^ ^l)) 


T 


F 


T 


F 


F 



Problem 2.1. Suppose v is a truth assignment such that v{Ao) — 
v{A2) = T and v{Ai) = f (/I3) = F. Find v{a) if a is: 

(1) -nA2 -A3 

(2) 'nA2 ^ As 

(3) -^{^Ao ^ Ai) 

(4) Ao V Ai 

(5) Ao A Ai 

The use of finite truth tables to determine what truth value a par- 
ticular truth assignment gives a particular formula is justified by the 
following proposition, which asserts that only the truth values of the 
atomic sentences in the formula matter. 

Proposition 2.2. Suppose S is any formula and u and v are truth 
assignments such that u{An) = v{An) for all atomic formulas An which 
occur in 6. Then u{6) = v{6). 

Corollary 2.3. Suppose u and v are truth assignments such that 
u{An) = v{An) for every atomic formula An- Then u = v, i.e. u{ip) = 
v{(fi) for every formula (p. 

Proposition 2.4. If a and (5 are formulas andv is a truth assign- 
ment, then: 

(1) v{-^q) =T if and only if v{a) = F. 

{2) v{a j3) = T if and only if v{j3) = T whenever v{a) — T ; 

(3) v{a A (3) = T if and only if v{a) = T and v{f3) = T; 

(4) v{a y P) = T if and only if v{a) = T or v{(3) = T; and 

(5) v{a (5) — T if and only if v{a) — v{(3). 
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Truth tables arc often used even when the formula in question is 
not broken down all the way into atomic formulas. For example, if a 
and (3 arc any formulas and wc know that a is true but j3 is false, then 
the truth of {a ("'/?)) can be determined by means of the following 
table: 



a 






{a ^ i^f3)) 


T 


F 


T 


T 



Definition 2.2. If f is a truth assignment and <^ is a formula, we 
will often say that v satisfies ip if v{ip) = T. Similarly, if E is a set 
of formulas, we will often say that v satisfies S if v{a) = T for every 
cr e E. We will say that ip (respectively, S) is satisfiable if there is 
some truth assignment which satisfies it. 

Definition 2.3. A formula </? is a tautology if it is satisfied by every 
truth assignment. A formula is a contradiction if there is no truth 
assignment which satisfies it. 

For example, {A4 — ^ A4) is a tautology while ("1(^4 — > ^4)) is a 
contradiction, and A4 is a formula which is neither. One can check 
whether a given formula is a tautology, contradiction, or neither, by 
grinding out a complete truth table for it, with a separate line for each 
possible assignment of truth values to the atomic subformulas of the 
formula. For A3 {A4 A3) this gives 



A3 


A, 


A^-^As 


As (A4 


T 


T 


T 


T 


T 


F 


T 


T 


F 


T 


F 


T 


F 


F 


T 


T 



so ^3 — > (A4 A3) is a tautology. Note that, by Proposition 2.2, we 
need only consider the possible truth values of the atomic sentences 
which actually occur in a given formula. 

One can often use truth tables to determine whether a given formula 
is a tautology or a contradiction even when it is not broken down all 
the way into atomic formulas. For example, if a is any formula, then 
the table 



a 


{a a) 


{^{a^a)) 


T 


T 


F 


F 


T 


F 



demonstrates that — > a)) is a contradiction, no matter which 
formula of Cp a actually is. 

Proposition 2.5. If a is any formula, then ((-iq;) V a) is a tau- 
tology and ((-la) A a) is a contradiction. 
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Proposition 2.6. A formula P is a tautology if and only if -^(3 is 
a contradiction. 

After all this warmup, we are finally in a position to define what it 
means for one formula to follow logically from other formulas. 

Definition 2.4. A set of formulas E implies a formula 93, written 
as S 1= ip, if every truth assignment v which satisfies S also satisfies </?. 
We will often write S y9 if it is not the case that S |= ip. In the case 
where E is empty, we will usually write |= ip instead of |= ip. 

Similarly, if A and F are sets of formulas, then A implies F, written 
as A 1= F, if every truth assignment v which satisfies A also satisfies 
F. 

For example, {A3, (A3 ^ ^Ay) } ^ ^Ay, but { Ag, (A5 ^ Ag) } 

A5. (There is a truth assignment which makes A^ and A5 — > A^ true, 
but false.) Note that a formula p is a, tautology if and only if |= p, 
and a contradiction if and only if |= (""/?)• 

Proposition 2.7. IfV and E are sets of formulas such that F C E, 
then E h r. 

Problem 2.8. How can one check whether or not E |= for a 
formula ip and a finite set of formulas E? 

Proposition 2.9. Suppose T, is a set of formulas and ip and p are 
formulas. Then E U {-0} \^ p if and only if \^ ip ^ p. 

Proposition 2.10. A set of formulas E is satisfiable if and only if 
there is no contradiction x such that E ^ 



CHAPTER 3 



Deductions 

In this chapter we develop a way of defining logical implication 
that does not rely on any notion of truth, but only on manipulating 
sequences of formulas, namely formal proofs or deductions. (Of course, 
any way of defining logical implication had better be compatible with 
that given in Chapter 2.) To define these, we first specify a suitable 
set of formulas which we can use freely as premisses in deductions. 

Definition 3.1. The three axiom schema oi Cp are: 
Al: {a^ {p^ a)) 

A2: {{a ^{(3^ 7)) - ((« ^ f3) ^ {a ^ 7))) 
A3: a) 

Replacing a, (3, and 7 by particular formulas of >Cp in any one of the 
schemas Al, A2, or A3 gives an axiom oi Cp. 

For example, (^41 — > {A4 Ai)) is an axiom, being an instance of 

axiom schema Al, but {Ag {-iAq)) is not an axiom as it is not the 
instance of any of the schema. As had better be the case, every axiom 
is always true: 

Proposition 3.1. Every axiom of Cp is a tautology. 

Second, we specify our one (and only!) rule of inference.^ 

Definition 3.2 (Modus Ponens). Given the formulas ip and (ip — > 
ijj), one may infer -0. 

We will usually refer to Modus Ponens by its initials, MP. Like any 
rule of inference worth its salt, MP preserves truth. 

Proposition 3.2. Suppose (p and ip are formulas. Then {(f, (</?—*■ 

With axioms and a rule of inference in hand, we can execute formal 
proofs in >Cp. 

"'^Natural deductive systems, which are usually more convenient to actually 
execute deductions in than the system being developed here, compensate for having 
few or no axioms by having many rules of inference. 

11 
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Definition 3.3. Let S be a set of formulas. A deduction or proof 
from E in £p is a finite sequence (fii(fi2 ■ ■ ■ fn of formulas such that for 
each k < n, 

(1) is an axiom, or 

(2) Lpk e S, or 

(3) there are i.j < k such that ipk follows from ipi and (pj by MP. 

A formula of E appearing in the deduction is called a premiss. E proves 
a formula a, written as E h a, if a is the last formula of a deduction 
from E. We'll usually write h a for h a, and take E h A to mean 
that E h 5 for every formula 5 e A. 

In order to make it easier to verify that an alleged deduction really 
is one, we will number the formulas in a deduction, write them out in 
order on separate lines, and give a justification for each formula. Like 
the additional connectives and conventions for dropping parentheses in 
Chapter 1, this is not officially a part of the definition of a deduction. 



Example 3.1. Let us show that !-(/?—>(/?. 

(1) ^ {{^ ^ ^) ^ ^)) ^ {{cp ^{cp^ cp)) ^ ^ v)) A2 

(2) ^^{(^^^)^ ^) Al 

(3) ^ ^ ip)) ^{^^^) 1,2 MP 

(4) <^ ^ ^ Al 

(5) (p^(p 3,4 MP 

Hence h </? — > (/?, as desired. Note that indication of the formulas from 
which formulas 3 and 5 beside the mentions of MP. 

Example 3.2. Let us show that {a ^ P, P ^ ^} \- a ^ ^. 

(1) (/? ^ ^) ^ (a ^ (/3 ^ 7)) Al 

(2) /9 — 7 Premiss 

(3) a (/? ^ 7) 1,2 MP 

(4) ia^if3^ 7)) - ((« ^f3)^{a^ 7)) A2 

(5) {a-^ p)^{a^ 7) 4,3 MP 

(6) a ^ f3 Premiss 

(7) a ^ 7 5,6 MP 

Hence /3^7}I-q;— >-7, as desired. 



It is frequently convenient to save time and effort by simply referring 
to a deduction one has already done instead of writing it again as part 
of another deduction. If you do so, please make sure you appeal only 
to deductions that have already been carried out. 

Example 3.3. Let us show that h (-la a) ^ a. 
(1) (-iCK — > -la) {{~'0i ^ a) — > a) A3 
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(2) -iQ; — >• -la Example 3.1 

(3) {^a ^a)^a 1,2 MP 

Hence h (-la — > a) — > a, as desired. To be completely formal, one 

would have to insert the deduction given in Example 3.1 (with Lp re- 
placed by -la throughout) in place of line 2 above and renumber the 
old line 3. 

Problem 3.3. Show that if a, jS, and'-) are formulas, then 

(1) {a^(/?^7),/5}^«^7 

(2) h a V -.a 

Example 3.4. Let us show that I — ^^jS (3. 

(1) ^ ^ ((-/3 ^ ^ /?) A3 

(2) {-^(3 ^^(3) Al 

(3) ^ ^ ^ /3) 1,2 Example 3.2 

(4) -^(3 -^(3 Example 3.1 

(5) -^^p (3 3,4 Problem 3.3.1 
Hence I — j3, as desired. 

Certain general facts are sometimes handy: 

Proposition 3.4. Ifipiip2 . . .(pn is a deduction ofCp, then (pi . . .(p£ 

is also a deduction of Cp for any £ such that 1 < i < n. 

Proposition 3.5. Ifr\-Sandr\-S^p, then F h 

Proposition 3.6. //P C A and r\- a, then Aha. 

Proposition 3.7. //P I- A and A\- a, then P h cr. 

The following theorem often lets one take substantial shortcuts 
when trying to show that certain deductions exist in >Cp, even though 
it doesn't give us the deductions explicitly. 

Theorem 3.8 (Deduction Theorem). // E is any set of formulas 
and a and (3 are any formulas, then h a ^ (3 if and only if'EL){a} h 
(3. 

Example 3.5. Let us show that \- ip ^ ip. By the Deduction 
Theorem it is enough to show that {(p} h (p, which is trivial: 

(1) (p Premiss 
Compare this to the deduction in Example 3.1. 

Problem 3.9. Appealing to previous deductions and the Deduction 
Theorem if you wish, show that: 

(1) {5, ^5} h 7 
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(2) \- ip ^ -\-\(p 

(3) h l^P ^a) ^{a^ (3) 

(4) h (a ^ /?) ^ ^ 

(5) h (/3 ^ ^a) ^ (« ^ 

(6) h ^ a) ^ (^a ^ /3) 

(7) h a ^ (a Vr) 

(8) {a A/3} h/3 

(9) {a A /?} h a 



CHAPTER 4 



Soundness and Completeness 

How are deduction and implication related, given that they were 
defined in completely different ways? We have some evidence that they 
behave alike; compare, for example. Proposition 2.9 and the Deduction 
Theorem. It had better be the case that if there is a deduction of a 
formula ip from a set of premisses S, then ip is implied by S. (Otherwise, 
what's the point of defining deductions?) It would also be nice for the 
converse to hold: whenever (/? is implied by E, there is a deduction of 
(/? from E. (So anything which is true can be proved.) The Soundness 
and Completeness Theorems say that both ways do hold, so E h 99 if 
and only if E |= </5, i.e. h and |= are equivalent for propositional logic. 
One direction is relatively straightforward to prove. . . 

Theorem 4.1 (Soundness Theorem). If ^ is a set of formulas and 
a is a formula such that Aha, then A |= a. 

. . . but for the other direction we need some additional concepts. 

Definition 4.1. A set of formulas F is inconsistent if F I — 1(0; —> 
a) for some formula a, and consistent if it is not inconsistent. 

For example, {^41} is consistent by Proposition 4.2, but it follows 
from Problem 3.9 that {A13, "lAia} is inconsistent. 

Proposition 4.2. // a set of formulas is satisfiable, then it is con- 
sistent. 

Proposition 4.3. Suppose A is an inconsistent set of formulas. 
Then A\- ip for any formula ip. 

Proposition 4.4. Suppose E is an inconsistent set of formulas. 
Then there is a finite subset A 0/ E such that A is inconsistent. 

Corollary 4.5. A set of formulas F is consistent if and only if 
every finite subset of F is consistent. 

To obtain the Completeness Theorem requires one more definition. 

Definition 4.2. A set of formulas E is maximally consistent if E 
is consistent but E U {</?} is inconsistent for any (/? ^ E. 

15 
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That is, a set of formulas is maximally consistent if it is consistent, 
but there is no way to add any other formula to it and keep it consistent. 

Problem 4.6. Suppose v is a truth assignment. Show that S = 
{(f I v{(fi) —T} is maximally consistent. 

We will need some facts concerning maximally consistent theories. 

Proposition 4.7. //S is a maximally consistent set of formulas, 
(p is a formula, and She/?, then (/? e E. 

Proposition 4.8. Suppose E is a maximally consistent set of for- 
mulas and (p is a formula. Then -up if and only if ^ E. 

Proposition 4.9. Suppose is a maximally consistent set of for- 
mulas and Lp and ip are formulas. Then (p ^ ip & if and only if 
(f ^ 12 or ip & 12. 

It is important to know that any consistent set of formulas can be 
expanded to a maximally consistent set. 

Theorem 4.10. Suppose T is a consistent set of formulas. Then 
there is a maximally consistent set of formulas E such that F C E. 

Now for the main event! 

Theorem 4.11. A set of formulas is consistent if and only if it is 
satisfiable. 

Theorem 4.11 gives the equivalence between h and |= in slightly 
disguised form. 

Theorem 4.12 (Completeness Theorem). If A is a set of formulas 
and a is a formula such that A \= a, then Aha. 

It follows that anything provable from a given set of premisses must 
be true if the premisses are, and vice versa. The fact that h and |= are 
actually equivalent can be very convenient in situations where one is 
easier to use than the other. For example, most parts of Problems 3.3 
and 3.9 are much easier to do with truth tables instead of deductions, 
even if one makes use of the Deduction Theorem. 

Finally, one more consequence of Theorem 4.11. 

Theorem 4.13 (Compactness Theorem). A set of formulas F is 
satisfiable if and only if every finite subset of F is satisfiable. 

We will not look at any uses of the Compactness Theorem now, 
but we will consider a few applications of its counterpart for first-order 
logic in Chapter 9. 



Hints for Chapters 1—4 



Hints for Chapter 1. 

1.1. Symbols not in the language, unbalanced parentheses, lack of 
connectives. . . 

1.2. The key idea is to exploit the recursive structure of Defini- 
tion 1.2 and proceed by induction on the length of the formula or on 
the number of connectives in the formula. As this is an idea that will 
be needed repeatedly in Parts 1, 11, and IV, here is a skeleton of the 
argument in this case: 

Proof. By induction on n, the number of connectives {i.e. occur- 
rences of -1 and/or ^) in a formula (p of Cp, we will show that any 
formula if must have just as many left parentheses as right parentheses. 

Base step: {n = 0) If is a formula with no connectives, then it 
must be atomic. (Why?) Since an atomic formula has no parentheses 
at all, it has just as many left parentheses as right parentheses. 

Induction hypothesis: {n < k) Assume that any formula with n < k 
connectives has just as many left parentheses as right parentheses. 

Induction step: {n = k + 1) Suppose is a formula with n = k + 1 
connectives. It follows from Definition 1.2 that ip must be either 

(1) {~<a) for some formula a with k connectives or 

(2) {P — > 7) for some formulas /3 and 7 which have < k connectives 
each. 

(Why?) We handle the two cases separately: 

(1) By the induction hypothesis, a has just as many left paren- 
theses as right parentheses. Since (/?, i.e. (-la), has one more 
left parenthesis and one more right parentheses than a, it must 
have just as many left parentheses as right parentheses as well. 

(2) By the induction hypothesis, /3 and 7 each have the same 
number of left parentheses as right parentheses. Since ip, i.e. 
{j3 q), has one more left parenthesis and one more right 
parnthesis than [3 and 7 together have, it must have just as 
many left parntheses as right parentheses as well. 

17 



18 HINTS FOR CHAPTERS 1-4 

It follows by induction that every formula ip oi Cp has just as many 
left parentheses as right parentheses. 

□ 

1.3. Compute p{a)/i{a) for a number of examples and look for 
patterns. Getting a minimum value should be pretty easy. 

1.4. Proceed by induction on the length of or on the number of 
connectives in the formula. 

1.5. Construct examples of formulas of all the short lengths that 
you can, and then see how you can make longer formulas out of short 
ones. 

1.6. Hewlett-Packard sells calculators that use such a trick. A sim- 
ilar one is used in Definition 5.2. 

1.7. Observe that Cp has countably many symbols and that every 
formula is a finite sequence of symbols. The relevant facts from set 
theory are given in Appendix A. 

1.8. Stick several simple statements together with suitable connec- 
tives. 

1.9. This should be straightforward. 

1.10. Ditto. 

1.11. To make sure you get all the subformulas, write out the for- 
mula in official form with all the parentheses. 

1.12. Proceed by induction on the length or number of connectives 
of the formula. 

Hints for Chapter 2. 

2.1. Use truth tables. 

2.2. Proceed by induction on the length of S or on the number of 
connectives in 5. 

2.3. Use Proposition 2.2. 

2.4. In each case, unwind Definition 2.1 and the definitions of the 
abbreviations. 

2.5. Use truth tables. 

2.6. Use Definition 2.3 and Proposition 2.4. 

2.7. If a truth assignment satisfies every formula in E and every 
formula in P is also in E, then. . . 
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2.8. Grinding out an appropriate truth table will do the job. Why 
is it important that E be finite here? 

2.9. Use Definition 2.4 and Proposition 2.4. 

2.10. Use Definitions 2.3 and 2.4. If you have trouble trying to 
prove one of the two directions directly, try proving its contrapositive 
instead. 

Hints for Chapter 3. 

3.1. Truth tables are probably the best way to do this. 

3.2. Look up Proposition 2.4. 

3.3. There are usually many different deductions with a given con- 
clusion, so you shouldn't take the following hints as gospel. 

(1) Use A2 and Al. 

(2) Recall what V abbreviates. 

3.4. You need to check that ({)\. . .(pi satisfies the three conditions 
of Definition 3.3; you know (p\. . .(p^ does. 

3.5. Put together a deduction of /3 from F from the deductions of 
b and 5 ^ (3 from P. 

3.6. Examine Definition 3.3 carefully. 

3.7. The key idea is similar to that for proving Proposition 3.5. 

3.8. One direction follows from Proposition 3.5. For the other di- 
rection, proceed by induction on the length of the shortest proof of /3 
fi:om EU {q;}. 

3.9. Again, don't take these hints as gospel. Try using the Deduc- 
tion Theorem in each case, plus 

(1) A3. 

(2) A3 and Problem 3.3. 

(3) A3. 

(4) A3, Problem 3.3, and Example 3.2. 

(5) Some of the above parts and Problem 3.3. 

(6) Ditto. 

(7) Use the definition of V and one of the above parts. 

(8) Use the definition of A and one of the above parts. 

(9) Aim for ->a — > (a — > as an intermediate step. 
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Hints for Chapter 4. 

4.1. Use induction on the length of the deduction and Proposition 

3.2. 

4.2. Assume, by way of contradiction, that the given set of formulas 
is inconsistent. Use the Soundness Theorem to show that it can't be 
satisfiable. 

4.3. First show that {-'{a a)} h ■0- 

4.4. Note that deductions are finite sequences of formulas. 

4.5. Use Proposition 4.4. 

4.6. Use Proposition 4.2, the definition of E, and Proposition 2.4. 

4.7. Assume, by way of contradiction, that 99 ^ S. Use Definition 
4.2 and the Deduction Theorem to show that E must be inconsistent. 

4.8. Use Definition 4.2 and Problem 3.9. 

4.9. Use Definition 4.2 and Proposition 4.8. 

4.10. Use Proposition 1.7 and induction on a list of all the formulas 
of Cp. 

4.11. One direction is just Proposition 4.2. For the other, expand 
the set of formulas in question to a maximally consistent set of formulas 

E using Theorem 4.10, and define a truth assignment v by setting 
v{An) = T if and only if An G S. Now use induction on the length of 
ip to show that (/? G E if and only if v satisfies 

4.12. Prove the contrapositive using Theorem 4.11. 

4.13. Put Corollary 4.5 together with Theorem 4.11. 
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CHAPTER 5 



Languages 

As noted in the Introduction, propositional logic has obvious defi- 
ciencies as a tool for mathematical reasoning. First-order logic remedies 
enough of these to be adequate for formalizing most ordinary mathe- 
matics. It does have enough in common with propositional logic to let 
us recycle some of the material in Chapters 1-4. 

A few informal words about how first-order languages work are in 
order. In mathematics one often deals with structures consisting of 
a set of elements plus various operations on them or relations among 
them. To cite three common examples, a group is a set of elements 
plus a binary operation on these elements satisfying certain conditions, 
a field is a set of elements plus two binary operations on these elements 
satisfying certain conditions, and a graph is a set of elements plus a 
binary relation with certain properties. In most such cases, one fre- 
quently uses symbols naming the operations or relations in question, 
symbols for variables which range over the set of elements, symbols 
for logical connectives such as not and for all, plus auxiliary symbols 
such as parentheses, to write formulas which express some fact about 
the structure in question. For example, if (G, ■) is a group, one might 
express the associative law by writing something like 

yx^y^z X ■ {y ■ z) — {x ■ y) ■ z , 

it being understood that the variables range over the set G of group 
elements. A formal language to do as much will require some or all of 
these: symbols for various logical notions and for variables, some for 
functions or relations, plus auxiliary symbols. It will also be necessary 
to specify rules for putting the symbols together to make formulas, for 
interpreting the meaning and determining the truth of these formulas, 
and for making inferences in deductions. 

For a concrete example, consider elementary number theory. The 
set of elements under discussion is the set of natural numbers N = 
{ 0, 1, 2, 3, 4, . . . }. One might need symbols or names for certain inter- 
esting numbers, say and 1; for variables over N such as n and x; for 
functions on N, say ■ and -|-; and for relations, say =, <, and |. In 
addition, one is likely to need symbols for punctuation, such as ( and 
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); for fogical connectives, such as -> and — >; and for quantifiers, such 
as V ( "for all" ) and 3 ( "there exists" ) . A statement of mathematical 
English such as "For all n and m, if n divides m, then n is less than or 
equal to m" can then be written as a cool formula like 

VnVm {n \ m ^ {n < m /\n — m)) . 

The extra power of first-order logic comes at a price: greater com- 
plexity. First, there are many first-order languages one might wish to 
use, practically one for each subject, or even problem, in mathematics.^ 
We will set up our definitions and general results, however, to apply to 
a wide range of them.^ 

As with £p, our formal language for prepositional logic, first-order 
languages are defined by specifying their symbols and how these may 
be assembled into formulas. 

Definition 5.1. The symbols of a first-order language C include: 

(1) Parentheses: ( and ). 

(2) Connectives: -i and — >. 

(3) Quantifier: V. 

(4) Variables: Vq, Vi, V2, . . . , Vn, ... 

(5) Equality: =. 

(6) A (possibly empty) set of constant symbols. 

(7) For each k > 1, a (possibly empty) set of k-place function 
symbols. 

(8) For each /c > 1, a (possibly empty) set of k-place relation (or 
predicate) symbols. 

The symbols described in parts 1-5 are the logical symbols of C, shared 
by every first-order language, and the rest are the non-logical symbols 
of C, which usually depend on what the language's intended use. 

Note. It is possible to define first-order languages without =, so = 
is considered a non-logical symbol by many authors. While such lan- 
guages have some uses, they are uncommon in ordinary mathematics. 

Observe that any first-order language C has countably many logical 
symbols. It may have uncountably many symbols if it has uncountably 
many non-logical symbols. Unless explicitly stated otherwise, we will 



It is possible to formalize almost all of mathematics in a single first-order 

language, like that of set theory or category theory. However, trying to actually do 
most mathematics in such a language is so hard as to be pointless. 

^Specifically, to countable one-sorted first-order languages with equality. 
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assume that every first-order language we encounter has only count- 
ably many non-logical symbols. Most of the results we will prove actu- 
ally hold for countable and uncountable first-order languages alike, but 
some require heavier machinery to prove for uncountable languages. 

Just as in Cp, the parentheses are just punctuation while the con- 
nectives, -1 and — are intended to express not and if . . . then. How- 
ever, the rest of the symbols are new and are intended to express ideas 
that cannot be handled by Cp. The quantifier symbol, V, is meant to 
represent for all, and is intended to be used with the variable symbols, 
e.g. Vf4. The constant symbols are meant to be names for particular 
elements of the structure under discussion, fc-place function symbols 
are meant to name particular functions which map /c-tuples of elements 
of the structure to elements of the structure, fc-place relation symbols 
are intended to name particular fc-place relations among elements of 
the structure.^ Finally, = is a special binary relation symbol intended 
to represent equality. 

Example 5.1. Since the logical symbols are always the same, first- 
order languages are usually defined by specifying the non-logical sym- 
bols. A formal language for elementary number theory like that unof- 
ficially described above, call it Cnt, can be defined as follows. 

• Constant symbols: and 1 

• Two 2-place function symbols: + and • 

• Two binary relation symbols: < and | 

Each of these symbols is intended to represent the same thing it does 
in informal mathematical usage: and 1 arc intended to be names 
for the numbers zero and one, + and ■ names for the operations of 
addition and multiplications, and < and | names for the relations "less 
than" and "divides" . (Note that we could, in principle, interpret things 
completely differently - let represent the number forty-one, -|- the 
operation of exponentiation, and so on - or even use the language to 
talk about a different structure - say the real numbers, M, with 0, 
1, +, •, and < representing what they usually do and, just for fun, 
I interpreted as "is not equal to". More on this in Chapter 6.) We 
will usually use the same symbols in our formal languages that we use 
informally for various common mathematical objects. This convention 

"^Intuitively, a relation or predicate expresses some (possibly arbitrary) relation- 
ship among one or more objects. For example, "n is prime" is a 1-place relation 
on the natural numbers, < is a 2-place or binary relation on the rationals, and 
ax(&xc) = Oisa 3- place relation on M^. Formally, a /c-place relation on a set X 
is just a subset of X*^, i.e. the collection of sequences of length k of elements of X 
for which the relation is true. 



26 



5. LANGUAGES 



can occasionally cause confusion if it is not clear whether an expression 
involving these symbols is supposed to be an expression in a formal 
language or not. 

Example 5.2. Here are some other first-order languages. Recall 

that we need only specify the non-logical symbols in each case and 
note that some parts of Definitions 5.2 and 5.3 may be irrelevant for 
a given language if it is missing the appropriate sorts of non-logical 
symbols. 

(1) The language of pure equality, £=: 

• No non-logical symbols at all. 

(2) A language for fields, Cp'- 

• Constant symbols: 0, 1 

• 2-place function symbols: -|-, • 

(3) A language for set theory, Cs'- 

• 2-place relation symbol: G 

(4) A language for linear orders, Co'- 

• 2-place relation symbol: < 

(5) Another language for elementary number theory, Cn'- 

• Constant symbol: 

• 1-place function symbol: S 

• 2-place function symbols: -|-, •, £^ 

Here is intended to represent zero, -S" the successor func- 
tion, i.e. S{n) ~ n-\-l, and E the exponential function, i.e. 
E{n, m) = n™". 

(6) A "worst-case" countable language, C^: 

• Constant symbols: ci, C2, C3, ... 

• For each k > 1. A;-place function symbols: /f , /|", /|, ... 

• For each k > 1, fc-place relation symbols: , P2, P^, ■ ■ ■ 
This language has no use except as an abstract example. 

It remains to specify how to form valid formulas from the symbols 
of a first-order language £. This will be more complicated than it was 
for Cp. In fact, we first need to define a type of expression in C which 
has no counterpart in propositional logic. 

Definition 5.2. The terms of a first-order language C are those 
finite sequences of symbols of C which satisfy the following rules: 

(1) Every variable symbol w„ is a term. 

(2) Every constant symbol c is a term. 

(3) If / is a /c-place function symbol and ti, . . . ,tk are terms, then 
fti . . .tk is also a term. 

(4) Nothing else is a term. 
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That is, a term is an expression which represents some (possibly 
indeterminate) element of the structure under discussion. For example, 
in /^ATT or £Ar, +voVi (informally, vq + vi ) is a term, though precisely 
which natural number it represents depends on what values are assigned 

to the variables vq and Vi. 

Problem 5.1. Which of the following are terms of one of the lan- 
guages defined in Examples 5.1 and 5.2? If so, which of these lan- 
guage(s) are they terms of; if not, why not? 

(1) ■V2 

(2) +0 ■ +vell 

(3) \l+vsO 

(4) (< EWl +11) 

(5) + + • + 00000 

(6) /I/7C4V9C1V4 

(7) M+'^vs) 

(8) < V6V2 

(9) 1 + 

Note that in languages with no function symbols all terms have 
length one. 

Problem 5.2. Choose one of the languages defined in Examples 
5.1 and 5.2 which has terms of length greater than one and determine 
the possible lengths of terms of this language. 

Proposition 5.3. The set of terms of a countable first-order lan- 
guage C is countable. 

Having defined terms, we can finally define first-order formulas. 

Definition 5.3. The formulas of a first-order language C are the 
finite sequences of the symbols of C satisfying the following rules: 

(1) If P is a /c-place relation symbol and ii, . . . ,tk are terms, then 
Pti . . . tfc is a formula. 

(2) If ti and ^2 are terms, then = tit2 is a formula. 

(3) If q; is a formula, then (-la) is a formula. 

(4) If a and (3 are formulas, then [a ^ (3) is a formula. 

(5) If y9 is a formula and f „ is a variable, then Vv„(^ is a formula. 

(6) Nothing else is a formula. 

Formulas of form 1 or 2 will often be referred to as the atomic formulas 
of C. 

Note that three of the conditions in Definition 5.3 are borrowed 
directy from propositional logic. As before, we will exploit the way 
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formulas are built up in making definitions and in proving results by 
induction on the length of a formula. We will also recycle the use 
of lower-case Greek characters to refer to formulas and of upper-case 
Greek characters to refer to sets of formulas. 

Problem 5.4. Which of the following are formulas of one of the 
languages defined in Examples 5.1 and 5.2? If so, which of these lan- 
guage(s) are they formulas of; if not, why not? 



(1) 


= Q + V-j ■ lV2, 




(2) 


(-, = ViVi) 




(3) 


(baO ^ -01) 




(4) 






(5) 


< +01|wiU3 




(6) 


(^3 = ^ Vt^s 




(7) 


Vv6(= veO Vt'9(- 




(8) 


Vvs < +11^4 





Problem 5.5. Show that every formula of a first-order language 
has the same number of left parentheses as of right parentheses. 

Problem 5.6. Choose one of the languages defined in Examples 5.1 
and 5.2 and determine the possible lengths of formulas of this language. 

Proposition 5.7. A countable first-order language C has countably 
many formulas. 

In practice, devising a formal language intended to deal with a par- 
ticular (kind of) structure isn't the end of the job: one must also specify 
axioms in the language that the structure (s) one wishes to study should 
satisfy. Defining satisfaction is officially done in the next chapter, but 
it is usually straightforward to unofficially figure out what a formula 
in the language is supposed to mean. 

Problem 5.8. In each case, write down a formula of the given 
language expressing the given informal statement. 

(1) ''Addition is associative" in Cp- 

(2) "There is an empty set" in Cg- 

(3) "Between any two distinct elements there is a third element" 
in Co- 

(4) = 1 for every n different from " in Cn- 

(5) "There is only one thing" in £=. 

Problem 5.9. Define first-order languages to deal with the follow- 
ing structures and, in each case, an appropriate set of axioms in your 
language: 
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(1) Groups. 

(2) Graphs. 

(3) Vector spaces. 

We will need a few additional concepts and facts about formulas of 
first-order logic later on. First, what are the subformulas of a formula? 

Problem 5.10. Define the set of subformulas of a formula (p of a 
first-order language L. 

For example, if </? is 

(((-iVt;i (-1 = vxci)) Piv5Vs) yvsi^ vgflcoViVs -P2 ^^s)) 

in the language Ci, then the set of subformulas of (f, S{(f), ought to 
include 

• = V1C7, Piv5V8, = VsficoViVr,, P2V8, 

• (-, = V1C7), (= VsficoViVn P^Vs), 

• ^Vi (-1 = V1C7), yV8{= VfiflcoViV^ Pivs), 

• {-^yVi (-1 = V1C7)), 

• {-^^vi = V1C7)) P^V5Vs), and 

• (((-iVwi (-. = V1C7)) Pivsvg) \fvs{= VsficoViV5 P2V8)) 

itself. 

Second, we will need a concept that has no counterpart in preposi- 
tional logic. 

Definition 5.4. Suppose a; is a variable of a first-order language 
jC. Then x occurs free in a formula (/? of £ is defined as follows: 

(1) If (/? is atomic, then x occurs free in (f if and only if x occurs 
in (f. 

(2) If ifi is {-^a), then x occurs free in ip if and only if x occurs free 
in a. 

(3) If </? is (/3 — > 5), then x occurs free in (f if and only if x occurs 

free in ,/? or in S. 

(4) If (/9 is Vf Ip, then x occurs free in ip if and only if x is different 
from Vk and x occurs free in tjj. 

An occurrence of x imp which is not free is said to be bound. A formula 
(7 of £ in which no variable occurs free is said to be a sentence. 

Part 4 is the key: it asserts that an occurrence of a variable x 
is bound instead of free if it is in the "scope" of an occurrence of 
Vx. For example, V7 is free in \fv^ = v^vr, but is not. Different 
occurences of a given variable in a formula may be free or bound, 
depending on where they are; e.g. vq occurs both free and bound in 



30 



5. LANGUAGES 



Problem 5.11. Give a precise definition of the scope of a quanti- 
fier. 

Note the distinction between sentences and ordinary formulas intro- 
duced in the last part of Definition 5.4. As we shall see, sentences are 
often more tractable and useful theoretically than ordinary formulas. 

Problem 5.12. Which of the formulas you gave in solving Prob- 
lem 5.8 are sentences? 

Finally, we will eventually need to consider a relationship between 
first-order languages. 

Definition 5.5. A first-order language C is an extension of a first- 
order language C, sometimes written as £ C C', if every non-logical 
symbol of >C is a non-logical symbol of the same kind of C. 

For example, every first-order language is an extension of C=. 

Problem 5.13. Which of the languages given in Example 5.2 are 
extensions of other languages given in Example 5.2? 

Proposition 5.14. Suppose C is a first-order language and CJ is 
an extension of L. Then every formula (p of C is a formula of C . 

Common Conventions. As with propositional logic, we will often 
use abbreviations and informal conventions to simplify the writing of 
formulas in first-order languages. In particular, we will use the same 
additional connectives we used in propositional logic, plus an additional 
quantifier, 3 ("there exists"): 

• (a A /9) is short for (-■(« 

• (a V /3) is short for ((-la) P). 

• {a ^ P) is short for ((« ^ /3) A{P ^ a)). 

• 3vk<f is short for (-iVwfc(-i</?)). 

(V is often called the universal quantifier and 3 is often called the 
existential quantifier.) 

Parentheses will often be omitted in formulas according to the same 
conventions we used in propositional logic, with the modification that 
V and 3 take precedence over all the logical connectives: 

• We will usually drop the outermost parentheses in a formula, 

writing a j3 instead of {a — > /3) and -la instead of (~iQ;). 

• Wc will let V take precedence over -i, and -> take precedence 
over when parentheses arc missing, and fit the informal ab- 
breviations into this scheme by letting the order of precedence 
be V, 3, -1, A, V, — >, and 



5. LANGUAGES 



31 



• Finally, we will group repetitions of — V, A, or <-> to the 
right when parentheses are missing, so a — > /3 — > 7 is short for 
(a^(/3^7))- 

For example, 3vk^a WvnP is short for {{-i\/vk{^{^a))) Vf„/9). 
On the other hand, we will sometimes add parentheses and arrange 
things in unofficial ways to make terms and formulas easier to read. In 
particular we will often write 

(1) f{ti, . . . ,tk) for fti . . . ifc if / is a k-place function symbol and 

ti, ■ ■ ■ , tk are terms, 

(2) s o t for ost if o is a 2-place function symbol and s and t are 

terms, 

(3) . . . , tfc) for Pti . . . ifc if P is a /c-place relation symbol and 
ti, . . . , tfc arc terms, 

(4) s • t for wst if • is a 2-place relation symbol and s and t are 

terms, and 

(5) s — t for — st ii s and t are terms, and 

(6) enclose terms in parentheses to group them. 

Thus, we could write the formula = +1 • Ovq ■ 11 oi jCnt StS 1 + {0 ■ vq) — 
1 ■ 1. 

As was observed in Example 5.1, it is customary in devising a formal 
language to recycle the same symbols used informally for the given 
objects. In situations where we want to talk about symbols without 
committing ourselves to a particular one, such as when talking about 
first-order languages in general, we will often use "generic" choices: 

• a, b, c, . . . for constant symbols; 

• X, y, z, . . . for variable symbols; 

• /, g, h, ... for function symbols; 

• P, Q, -R, ... for relation symbols; and 

• r, s, t, . . . for generic terms. 

These can be thought of as variables in the metalanguage^ ranging over 
different kinds objects of first-order logic, much as we're already using 
lower-case Greek characters as variables which range over formulas. (In 
fact, we have already used some of these conventions in this chapter. . . ) 

Unique Readability. The slightly paranoid might ask whether 
Definitions 5.1, 5.2 and 5.3 actually ensure that the terms and formulas 
of a first-order language C are unambiguous, i.e. cannot be read in 

^The metalanguage is the language, mathematical EngUsh in this case, in which 

wc talk about a language. The theorems we prove about formal logic are, strictly 
speaking, metatheorems, as opposed to the theorems proved within a formal logical 
system. For more of this kind of stuff, read some philosophy. . . 
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more than one way. As with i2p, to actually prove this one must 
assume that all the symbols of L are distinct and that no symbol is a 
subsequence of any other symbol. It then follows that: 

Theorem 5.15. Any term of a first-order language C satisfies ex- 
actly one of conditions 1-3 in Definition 5.2. 

Theorem 5.16 (Unique Readability Theorem). Any formula of a 
first-order language satisfies exactly one of conditions 1-5 in Definition 
5.3. 



CHAPTER 6 



Structures and Models 

Defining truth and implication in first-order logic is a lot harder 
than it was in propositional logic. First-order languages are intended 
to deal with mathematical objects like groups or linear orders, so it 
makes little sense to speak of the truth of a formula without specifying 
a context. For example, one can write down a formula expressing the 
commutative law in a language for group theory, \/x\/yx ■ y — y ■ x, 
but whether it is true or not depends on which group we're dealing 
with. It follows that we need to make precise which mathematical 
objects or structures a given first-order language can be used to discuss 
and how, given a suitable structure, formulas in the language are to 
be interpreted. Such a structure for a given language should supply 
most of the ingredients needed to interpret formulas of the language. 
Throughout this chapter, let C be an arbitrary fixed countable first- 
order language. All formulas will be assumed to be formulas of £ unless 
stated otherwise. 

Definition 6.1. A structure dJl for C consists of the following: 

(1) A non-empty set M, often written as |9Jt|, called the universe 

of m. 

(2) For each constant symbol c of C, an element of M. 

(3) For each fc-place function symbol f of £, a function : 
M'' — > M, i.e. a fc-place function on M. 

(4) For each /c-place relation symbol P of C, a relation C M'^, 
i.e. a k-place relation on M. 

That is, a structure supplies an underlying set of elements plus in- 
terpretations for the various non-logical symbols of the language: con- 
stant symbols are interpreted by particular elements of the underlying 
set, function symbols by functions on this set, and relation symbols by 
relations among elements of this set. 

It is customary to use upper-case "gothic" characters such as Wl 
and 9^ for structures. 

For example, consider 13 = (Q, <), where < is the usual "less than" 
relation on the rationals. This is a structure for Co, the language for 
linear orders defined in Example 5.2; it supplies a 2-place relation to 
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interpret the language's 2-place relation symbol. is not the only 
possible structure for £o' i^, <), ({0}, 0), and (N, N^) are three others 
among infinitely many more. (Note that in these cases the relation 
symbol < is interpreted by relations on the universe which are not 
linear orders. One can ensure that a structure satisfy various condi- 
tions beyond what Definition 6.1 guarantees by requiring appropriate 
formulas to be true when interpreted in the structure.) On the other 
hand, (R) is not a structure for Co because it lacks a binary relation 
to interpret the symbol < by, while (N, 0, 1, +, •, |, <) is not a structure 
for Co because it has two binary relations where Cq has a symbol only 
for one, plus constants and functions for which Cq lacks symbols. 

Problem 6.1. The first-order languages referred to below were all 
defined in Example 5.2. 

(1) Is (0) a structure for C= ? 

(2) Determine whether O. = (Q, <) is a structure for each of C=, 
Cp, and Cs. 

(3) Give three different structures for Cp which are not fields. 

To determine what it means for a given formula to be true in a 
structure for the corresponding language, we will also need to specify 
how to interpret the variables when they occur free. (Bound variables 
have the associated quantifier to tell us what to do.) 

Definition 6.2. Let V = {vq, vi, ^2, • • • } be the set of all variable 
symbols of C and suppose 9Jt is a structure for C. A function s -.V ^ 
1 971 1 is said to be an assignment for 971. 

Note that these are not truth assignments like those for Cp. An 
assignment just interprets each variable in the language by an element 
of the universe of the structure. Also, as long as the universe of the 
structure has more than one element, any variable can be interpreted 
in more than one way. Hence there are usually many different possible 
assignments for a given structure. 

Example 6.1. Consider the structure 91 = (M, 0, 1,+,-) for Cp. 
Each of the following functions y — > R is an assignment for 91: 

(1) p(iVi) = TT for each n, 

(2) r{vn) = for each n, and 

(3) s{vn) = n + 1 for each n. 

In fact, every function V ^ IR is an assignment for 9i. 

In order to use assignments to determine whether formulas are true 
in a structure, we need to know how to use an assignment to interpret 
each term of the language as an element of the universe. 
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Definition 6.3. Suppose 9Jt is a structure for L and s:V ^ |3Jt| 
is an assignment for Let T be the set of all terms of L. Then the 
extended assignment s: T — > is defined inductively as follows: 

(1) For each variable s{x) = s{x). 

(2) For each constant symbol c, s(c) = c^. 

(3) For every A';-placc function symbol / and terms ti, . . . , tk, 

S(^i...tfe) = /™(s(ti),...,s(tfe)). 

Example 6.2. Let 9^ be the structure for given in Example 
6.1, and let p, r, and s be the extended assignments corresponding to 
the assignments p, r, and s defined in Example 6.1. Consider the term 
+ ■ vqVq + 0v3 of Cp. Then: 

(1) p(+ • VqVo + OVs) = TT^ + TT, 

(2) r(+ • vqVo + OV3) = + and 

(3) S(+ • VqVo + OV3) = 11. 

Here's why for the last one: since s{vq) = 7, s{vo) — 1, s{v3) = 4, 
and s(0) = (by part 2 of Definition 6.3), it follows from part 3 of 
Definition 6.3 that s(+ ■ vqVq + OV3) = (7 ■ 1) + (0 + 4) = 7 + 4 = 11. 

Problem 6.2. *H = {N,0, S,+,-,E) is a structure for Cn- Let 
s: V — >• N 6e the assignment defined by s{vk) = k + 1. What are 
s(E + vigvi ■ OV45) and s(SSS + EOveVj) ? 

Proposition 6.3. s is unique, i.e. given an assignment s, no other 
function T ^ \dJH\ satisfies conditions 1-3 in Definition 6.3. 

With Definitions 6.2 and 6.3 in hand, we can take our first cut at 
defining what it means for a first-order formula to be true. 

Definition 6.4. Suppose 97t is a structure for £, s is an assignment 
for 971, and 99 is a formula of C. Then 9Jt |= ip[s\ is defined as follows: 

(1) If Lp is t\ = t2 for some terms ti and ^2, then 9Jt |= Lp[s\ if and 
only if s{ti) — 8(^2)- 

(2) If (p is Pti . . .tk for some /c-place relation symbol P and terms 
ti, . . . , tk, then 9Jt |= (p[s] if and only if (s(ii), . . . , s{tk)) G P™, 
i.e. is true of (s(ti), . . . , s{tk)). 

(3) If (p is {^ip) for some formula ip, then DJl \= (p[s] if and only if 
it is not the case that 9Jt |= ip[s]. 

(4) If V? is (a ^ /?), then TI \= ^[s] if and only if M \= f3[s] 
whenever Tt \= a[s], i.e. unless DJl \= a[s] but not DJl \= f3[s]. 

(5) If ip is Va; 6 for some variable x, then 3Jt |= ip[s] if and only if for 
all m e |9Jt|, Wl 1= 5[s(a;|m)], where s{x\m) is the assignment 
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given by 

s{vk) if Vk is different from x 
m if Vk is X. 



s{x\m){vk) 



If 971 1= ip[s], we shall say that 9Jt satisfies if on assignment s or that 
ip is true in 9Jl on assignment s. We will often write 9Jti^ ip[s] if it is 
not the case that Wt \= (p[s]. Also, if F is a set of formulas of £, we 
shall take dJl \= r[s] to mean that 3Jl |= j[s] for every formula 7 in F 
and say that 9?t satisfi.es F on assignment s. Similarly, we shall take 
9Jti^ F[s] to mean that 971 7[s] for some formula 7 in F. 

Clauses 1 and 2 are pretty straightforward and clauses 3 and 4 are 
essentially identical to the corresponding parts of Definition 2.1. The 
key clause is 5, which says that V should be interpreted as "for all 
elements of the universe" . 

Example 6.3. Let 91 be the structure for Cp and s the assignment 
for 9^ given in Example 6.1, and consider the formula Vf 1 (= v^-Ovi 
V3O) of Cp- We can verify that 91 |= Vvi (= ^3 • Ovi —>■= V3O) [s] as 
follows: 

91 ^ V^;i (= V3 ■ Ovi ^= ^;30) [s] 
<^ for all a e |9l|, 91 h (= • 0^;i ^= V3O) [s(vi\a)] 
<(=^ for all ae |9l|, if 91 1== V3 ■ Ovi [s{vi\a)], 

then 91 h= 't;30[s(T;i|a)] 
<^=^ for all a e |9l|, if s{vi\a){v3) — s('i;i|a)(-Oi'i), 

then s{vi\a){v3) — s(fi|a)(0) 

<(=^ for all a e |9l|, if s{v3) — s(i'i|a)(0) • s{vi\a){vi), then s{v3) — 

<(=^ for all a e |9l|, if s{v3) — • a, then s{v3) — 

<^ for all a e |9l|, if 4 = • a, then 4 = 

<^ for all a e |9l|, if 4 = 0, then 4 = 

. . . which last is true whether or not 4 = is true or false. 

Problem 6.4. Let^ be the structure for £n in Problem 6.2. Let 
p : y — > N be defined by p{v2k) — k and p{v2k+i) — k. Verify that 

(1) 91 1= \/w {^Sw = 0) [p] and 

(2) mi^yx3yx + y ^0\p]. 

Proposition 6.5. Suppose DJl is a structure for C, s is an as- 
signment for 971, X is a variable, and ip is a formula of a first- order 
language C. Then 971 |= 3a;9?[s] if and only ifDJl \= ip[s{x\m)] for some 
m e |97l|. 
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Working with particular assignments is difficult but, while some- 
times unavoidable, not always necessary. 

Definition 6.5. Suppose 9Jt is a structure for £, and a formula 
of C Then 3Jl ^ </? if and only if OJl |= for every assignment 
s : V ^ |OJl| for OJl. 9Jl is a model of ^9 or that is true in OJt if 
9JI 1= (/?. We will often write 9K if it is not the case that 9Jt |= 

Similarly, if F is a set of formulas, we will write 9Jt |= F if 9Jl |= 7 
for every formula 7 G F, and say that 3Jl is a model of F or that 2Jl 
satisfies F. A formula or set of formulas is satis fiable if there is some 
structure OJt which satisfies it. We will often write 9Jl F if it is not 
the case that 9Jt |= F. 

Note. OJt (/? does not mean that for every assignment s : V ^ 
|9Jt|, it is not the case that 9Jt |= (p[s]. It only means that that there is 
some assignment r : V ^ for which 3Jt |= ip[r] is not true. 

Problem 6.6. = (Q, <) is a structure for Co- For each of the 
following formulas ip of Co, determine whether or not £} |= (p. 

(1) \/Vo^V2Vo < V2 

(2) 3vi {Vi <V3^Vi^ V3) 

(3) Vf4Vf5 Vf6(f4 <V5 ^ (f5 < f 6 ^ f 4 < Vq)) 

The following facts are counterparts of sorts for Proposition 2.2. 
Their point is that what a given assignment does with a given term or 
formula depends only on the assignment's values on the (free) variables 
of the term or formula. 

Lemma 6.7. Suppose DJl is a structure for C, t is a term of C, and 
r and s are assignments forDJl such that r{x) = s{x) for every variable 
X which occurs in t. Then r(t) = s{t). 

Proposition 6.8. Suppose 0?t a structure for C, (p is a formula 
of C, and r and s are assignments for 3Jt such that r{x) — s{x) for 
every variable x which occurs free in </?. Then 9Jl |= </?[r] if and only if 

Corollary 6.9. Suppose 9Jl is a structure for C and a is a sen- 
tence of C. Then 9K |= cr z/ and only if there is some assignment 
s:V ^\'>m\ for m such that mt |= a[s\. 

Thus sentences are true or false in a structure independently of any 
particular assignment. This docs not necessarily make life easier when 
trying to verify whether a sentence is true in a structure - try doing 
Problem 6.6 again with the above results in hand - but it does let us 
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simplify things on occasion when proving things about sentences rather 
than formulas. 

We recycle a sense in which we used |= in propositional logic. 

Definition 6.6. Suppose F is a set of formulas of L and ^0 is a 

formula of C Then F implies ip, written as F |= if 3Jl |= whenever 
9Jt 1= F for every structure 9Jt for C. 

Similarly, if F and A are sets of formulas of then F implies A, 
written asF|=A, ifnJl|=A whenever 2Jl |= F for every structure 3Jl 
for C 

We will usually write |= . . . for |= 

Proposition 6.10. Suppose a and /? are formulas of some first- 
order language. Then { (a — > a } |= 

Proposition 6.11. Suppose H is a set of formulas and ip and p 
are formulas of some first-order language. Then S U \= p if and 
only ifT. |= {i) p). 

Definition 6.7. A formula of >C is a tautology if it is true in 
every structure, i.e. if |= ip. ^ is a contradiction if is a tautology, 
i.e. if ^ 

For some trivial examples, let be a formula of C and 3Jl a structure 
for C. Then 9Jt |= {93} if and only if OK |= so it must be the case 
that {if} 1= v^. It is also easy to check that ^ is a tautology and 
— > is a contradiction. 

Problem 6.12. Show that\/yy — y is a tautology and that ^y->y — 

y is a contradiction. 

Problem 6.13. Suppose tp is a contradiction. Show that 9Jt |= ip[s] 
is false for every structure OJl and assignment s -.V ^ |3Jl| for OJt. 

Problem 6.14. Show that a set of formulas E is satisfiable if and 
only if there is no contradiction x such that E |= x- 

The following fact is a counterpart of Proposition 2.4. 

Proposition 6.15. Suppose d3l is a structure for C and a and {3 
are sentences of C. Then: 

(1) 9K 1= -iQ; if and only ifVJlJi^ a. 

(2) 9Jt 1= a /3 if and only ifDJt\=/3 whenever 3Jl ^ ct. 

(3) m\=ay f3 if and only OTt |= a or M |= /3. 

(4) mt 1= a A /3 if and only ?/ 9Jt |= a and |= 

(5) 9Jl 1= q; ^ /3 if and only i/9Jl |= a exactly when 9Jl |= {3. 

(6) 9Jl 1= Vx a if and only z/ 3Jl |= a. 
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(7) Tl 1= 3x a if and only if there is some m G |9Jt| so that 9Jt |= 
a [s(x|m)] for every assignment s for VJl. 

Problem 6.16. How much of Proposition 6.15 must remain true 
if a and (5 are not sentences? 

Recall that by Proposition 5.14 a formula of a first-order language 
is also a formula of any extension of the language. The following rela- 
tionship between extension languages and satisfiability will be needed 
later on. 

Proposition 6.17. Suppose C is a first-order language, C is an 
extension of C, and T is a set of formulas of C. Then T is satisfiahle 
in a structure for C if and only ifV is satisfiahle in a structure for CJ . 

One last bit of terminology. . . 

Definition 6.8. If 971 is a structure for £, then the theory of 9Jt is 
just the set of all sentences of £ true in 9Jt, i.e. 

Th(97l) = { r I r is a sentence and 9Jt |= r }. 

If A is a set of sentences and 5 is a collection of structures, then A is 
a set of (non-logical) axioms for S if for every structure 3Jl, 3Jl e <S if 
and only if 9Jt ^ A. 

Example 6.4. Consider the sentence 3x3y{{->x — y) /\'^z{z — 
xV z — y)) of C=. Every structure of jC= satisfying this sentence must 
have exactly two elements in its universe, so { 3x 3y {{-'X = y) AV2; {z = 
a; V 2; = y)) } is a set of non-logical axioms for the collection of sets of 
cardinality 2: 

{ 9Jt I 9Jl is a structure for C= with exactly 2 elements } . 

Problem 6.18. In each case, find a suitable language and a set of 
axioms in it for the given collection of structures. 

(1) Sets of size 3. 

(2) Bipartite graphs. 

(3) Commutative groups. 

(4) Fields of characteristic 5. 
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Deductions 

Deductions in first-order logic are not unlike deductions in propo- 
sitional logic. Of course, some changes are necessary to handle the 
various additional features of propositional logic, especially quantifiers. 
In particular, one of the new axioms requires a tricky preliminary def- 
inition. Roughly, the problem is that we need to know when we can 
replace occurrences of a variable in a formula by a term without letting 
any variable in the term get captured by a quantifier. 

Throughout this chapter, let £ be a fixed arbitrary first-order lan- 
guage. Unless stated otherwise, all formulas will be assumed to be 
formulas of L. 

Definition 7.1. Suppose x is a variable, t is a term, and </? is a 
formula. Then t is substitutable for x in (f is defined as follows: 

(1) If is atomic, then t is substitutable for x in (f. 

(2) If (p is (-'V')) then t is substitutable for x in if and only if t 
is substitutable for x in ip. 

(3) If ip is {a ^ P), then t is substitutable for x in cp ii and only 
if t is substitutable for x in a and t is substitutable for x in (3. 

(4) U ip is \/yS, then t is substitutable for x in if and only if 
either 

(a) x docs not occur free in ip, or 

(b) if y does not occur in t and t is substitutable for x in 5. 

For example, x is always substitutable for itself in any formula 
(f and (p^^ is just ip (see Problem 7.1). On the other hand, y is not 
substitutable for x in^yx — y because if x were to be replaced by y, 
the new instance of y would be "captured" by the quantifier \/y. This 
makes a difference to the truth of the formula. The truth of \fy x = y 
depends on the structure in which it is interpreted — it's true if the 
universe has only one element and false otherwise — but \/yy — y is 
a tautology by Problem 6.12 so it is true in any structure whatsoever. 
This sort of difficulty makes it necessary to be careful when substituting 
for variables. 



41 



42 



7. DEDUCTIONS 



Definition 7.2. Suppose x is a variable, t is a term, and (p is 
a formula. If t is substitutable for x in (p, then (p^ {i.e. (p with t 
substituted for x) is defined as follows: 

(1) li (p is atomic, then ip^ is the formula obtained by replacing 
each occurrence of a; in by t. 

(2) If (/9 is then (pf is the formula {-iipf). 

(3) If is (a — >• then is the formula {a^ — >• 

(4) If (/? is Vy5, then c/?^ is the formula 

(a) \/yS if X is y, and 

(b) 5^ if X isn't y. 

Problem 7.1. (1) Is x substitutable for z in ip if ijj is z — 
X ^ \/z z = X? If so, what is ipl ? 

(2) Show that if t is any term and a is a sentence, then t is sub- 
stitutable in a for any variable x. What is erf? 

(3) Show that if t is a term in which no variable occurs that occurs 
in the formula (p, then t is substitutable in (p for any variable 

X. 

(4) Show that x is substitutable for x in ip for any variable x and 
any formula (p, and that (p^ is just (p. 

Along with the notion of substitutability, we need an additional 
notion in order to define the logical axioms of 

Definition 7.3. If (p is any formula and xi, . . . , Xn are any vari- 
ables, then \/xi . . . \/xn ^p is said to be a generalization of ip. 

For example, WyWx {x = y ^ fx = fy) and \/z {x = y ^ fx = fy) 
are (different) generalizations oix = y^fx = fy., but Vx 3y [x = 
y ^ fx = fy) is not. Note that the variables being quantified don't 
have to occur in the formula being generalized. 

Lemma 7.2. Any generalization of a tautology is a tautology. 

Definition 7.4. Every first-order language C has eight logical ax- 
iom schema: 

Al: {a ^{(5^ a)) 

A2: {{a ^{(5^ 7)) - ((« ^ P) ^ {a ^ 7))) 
A3: ^ (-a)) ^ ^ a) ^ /?))_ 

A4: (Vxa — a^), if t is substitutable for x in a. 

A5: (Vx (a /?) ^ (Vx a Vx {3)) 

A6: (a ^ Vxa), if x docs not occur free in a. 

A7: X — X 
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A8: (a; = 1/ — s> (a — >• if a is atomic and P is obtained from 
a by replacing some occurrences (possibly all or none) of x in 
a by y. 

Plugging in any particular formulas of £ for a, P, and 7, and any 
particular variables for x and y, in any of A1-A8 gives a logical axiom 
of C In addition, any generalization of a logical axiom of £ is also a 
logical axiom of JC. 

The reason for calling the instances of A1-A8 the logical axioms, 
instead of just axioms, is to avoid conflict with Deflnition 6.8. 

Problem 7.3. Determine whether or not each of the following for- 
mulas is a logical axiom. 

(1) \/xyz{x — y^{x — c^x — y)) 

(2) x = y^{y^z^z^x) 

(3) \/z {x = y ^ {x = c —>■ y = c)) 

(4) \/w 3x {Pwx — >• Pww) — >• 3x {Pxx — >• Pxx) 

(5) Vx (Vx c = fxc — > Vx Vx c = /xc) 

(6) (3x ^ 3y Rzfy) {{3x Px Vy -V^ ^ Vx ^Px) 

Proposition 7.4. Every logical axiom is a tautology. 

Note that we have recycled our axiom schemas Al — A3 from propo- 
sitional logic. We will also recycle MP as the sole rule of inference for 
flrst-order logic. 

Definition 7.5 (Modus Ponens). Given the formulas ip and ((^ — > 
■0), one may infer ■0. 

As in propositional logic, we will usually refer to Modus Ponens by 
its initials, MP. That MP preserves truth in the sense of Chapter 6 
follows from Problem 6.10. Using the logical axioms and MP, we can 
execute deductions in first-order logic just as we did in propositional 
logic. 

Definition 7.6. Let A be a set of formulas of the first-order lan- 
guage C A deduction or proof from A in £ is a finite sequence 
(pi(p2 ■ ■ ■ (fin of formulas of C such that for each k < n, 

(1) (fk is a. logical axiom, or 

(2) G A, or 

(3) there are i,j < k such that follows from ipi and Lpj by MP. 

A formula of A appearing in the deduction is usually referred to as a 
premiss of the deduction. A proves a formula a, written as A h a, if 
a is the last formula of a deduction from A. We'll usually write h a 
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instead of h a. Finally, if T and A are sets of formulas, we'll take 
r h A to mean that F h (5 for every formula 5 e A. 

Note. We have reused the axiom schema, the rule of inference, and 
the definition of deduction from propositional logic. It follows that any 
deduction of propositional logic can be converted into a deduction of 
first-order logic simply by replacing the formulas of Cp occurring in 
the deduction by first-order formulas. Feel free to appeal to the deduc- 
tions in the exercises and problems of Chapter 3. You should probably 
review the Examples and Problems of Chapter 3 before going on, since 
most of the rest of this Chapter concentrates on what is different about 
deductions in first-order logic. 

Example 7.1. We'll show that {a} \- 3xa for any first-order for- 



mula a and any variable x. 

(1) (Vx-iQ! —>■ -la) — >• (a — >• -iVx-ia) Problem 3.9.5 

(2) \/x -la -la A4 

(3) a^^yx^a 1,2 MP 

(4) a Premiss 

(5) -nWx 3,4 MP 

(6) 3xa Definition of 3 



Strictly speaking, the last line is just for our convenience, like 3 itself. 

Problem 7.5. Show that: 

(1) \-\/xip ^ ^^y^y) if y does not occur at all in ip. 

(2) h a V -.a. 

(3) {c — d}\- \/z Qazc — > Qazd. 

(4) \-x = y^y = x. 

(5) {3xa} \- a if X does not occur free in a. 

Many general facts about deductions can be recycled from propo- 
sitional logic, including the Deduction Theorem. 

Proposition 7.6. If ipiip2 . . .ipn is a deduction of C, then ipi . . .ip^ 
is also a deduction of C for any t such that 1 < i < n. 

Proposition 7.7. Ifr\-Sandr\-S^p, then F h /3. 

Proposition 7.8. //F C A and r\- a, then Aha. 

Proposition 7.9. Then z/F h A and A\- a, then T \- a. 

Theorem 7.10 (Deduction Theorem). //S is any set of formulas 
and a and {3 are any formulas, then S h a — > /3 if and only i/SU{Q;} h 
(5. 
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Just as in propositional logic, the Deduction Theorem is useful be- 
cause it often lets us take shortcuts when trying to show that deductions 
exist. There is also another result about first-order deductions which 
often supplies useful shortcuts. 

Theorem 7.11 (Generalization Theorem). Suppose x is a variable, 
r is a set of formulas in which x does not occur free, and (p is a formula 
such that r h Then T \-\/xLp. 

Theorem 7. 12 (Generalization On Gonstants) . Suppose that c is a 
constant symbol, F is a set of formulas in which c does not occur, and 
(f is a formula such that The/?. Then there is a variable x which does 
not occur in ip such that T h Va;<^^.^ Moreover, there is a deduction of 
\/x ip'^^ from r in which c does not occur. 

Example 7.2. We'll show that if ip and if) are any formulas, x is 
any variable, and \- ip ip, then \-\/xp> ^ Wxip. 

Since x does not occur free in any formula of 0, it follows from 
\- (p ^ jphy the Generalization Theorem that h Vx (</?—> ■0) . But then 

(1) Vx {ip — > ijj) above 

(2) \/x {ip ^ Ip) —>■ (Vx ip ^yxip) A5 

(3) ^xip^^xip 1,2 MP 

is the tail end of a deduction of \/xip -^\/xip from 0. 

Problem 7.13. Show that: 

(1) \- 'ixyy'i z {x = y ^ {y = z ^ X = z)). 

(2) \-\/xa ^ 3x a. 

(3) h 3x7 — > Vx7 ^/^ does not occur free in 7. 
We conclude with a bit of terminology. 

Definition 7.7. If E is a set of sentences, then the theory of E is 
Th(E) = { r I r is a sentence and E h r }. 

That is, the theory of E is just the collection of all sentences which 
can be proved from E. 



^ip% is if with every occurence of the constant c replaced by x. 
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Soundness and Completeness 

As with prepositional logic, first-order logic had better satisfy the 
Soundness Theorem and it is desirable that it satisfy the Completeness 
Theorem. These theorems do hold for first-order logic. The Soundness 
Theorem is proved in a way similar to its counterpart for propositional 
logic, but the Completeness Theorem will require a fair bit of additional 
work.^ It is in this extra work that the distinction between formulas 
and sentences becomes useful. 

Let £ be a fixed countable first-order language throughout this 
chapter. All formulas will be assumed to be formulas of L unless stated 
otherwise. 

First, we rehash many of the definitions and facts we proved for 
propositional logic in Chapter 4 for first-order logic. 

Theorem 8.1 (Soundness Theorem). Ij a is a sentence and A is 
a set of sentences such that Aha, then A a. 

Definition 8.1. A set of sentences F is inconsistent if F I — i{il> — >■ 
ip) for some formula ip, and is consistent if it is not inconsistent. 

Recall that a set of sentences F is satisfiable if 9Jt ^ F for some 
structure Tt. 

Proposition 8.2. // a set of sentences F is satisfiable, then it is 
consistent. 

Proposition 8.3. Suppose A is an inconsistent set of sentences. 
Then Ah i/j for any formula ijj. 

Proposition 8.4. Suppose E is an inconsistent set of sentences. 
Then there is a finite subset A o/ E such that A is inconsistent. 

Corollary 8.5. A set of sentences F is consistent if and only if 
every finite subset of F is consistent. 



This is not too surprising because of the greater complexity of first-order logic. 
Also, it turns out that first-order logic is about as powerful as a logic can get and 
still have the Completeness Theorem hold. 
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Definition 8.2. A set of sentences S is maximally consistent if S 
is consistent but E U {r} is inconsistent whenever r is a sentence such 
that T ^ E. 

One quick way of finding examples of maximally consistent sets is 
given by the following proposition. 

Proposition 8.6. IfOSi is a structure, then Th{Wl) is a maximally 
consistent set of sentences. 

Example 8.1. SJl = ({5}) is a structure for £=, so Th(aJl) is a 

maximally consistent set of sentences. Since it turns out that Th(9Jt) = 
Th ({ Vx Vya; = y}), this also gives us an example of a set of sentences 
E = {yxMyx = y} such that Th(S) is maximally consistent. 

Proposition 8.7. //S is a maximally consistent set of sentences, 
T is a sentence, and S h r, then r G E. 

Proposition 8.8. Suppose H is a maximally consistent set of sen- 
tences and T is a sentence. Then -tr & T, if and only if t ^ Tj. 

Proposition 8.9. Suppose H is a maximally consistent set of sen- 
tences and tp and ip are any sentences. Then ip ^ ip G 12 if and only if 
ip ^ or ip & Tj. 

Theorem 8.10. Suppose T is a consistent set of sentences. Then 
there is a maximally consistent set of sentences E with F C E. 

The counterparts of these notions and facts for propositional logic 
sufficed to prove the Completeness Theorem, but here we will need 
some additional tools. The basic problem is that instead of defining a 
suitable truth assignment from a maximally consistent set of formulas, 
we need to construct a suitable structure from a maximally consistent 
set of sentences. Unfortunately, structures for first-order languages are 
usually more complex than truth assignments for propositional logic. 
The following definition supplies the key new idea we will use to prove 
the Completeness Theorem. 

Definition 8.3. Suppose E is a set of sentences and C is a set of 
(some of the) constant symbols of C Then C is a set of witnesses for 
E in if for every formula (p oi £ with at most one free variable x, 
there is a constant symbol c e C such that E h 3xv? ^ p^. 

The idea is that every element of the universe which E proves must 
exist is named, or "witnessed", by a constant symbol in C. Note that 
if E I — 3x(p, then E h 3x </? — > </7^ for any constant symbol c. 
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Proposition 8.11. Suppose T and S are sets of sentences of C, 
r C E, and C is a set of witnesses for F in C Then C is a set of 
witnesses for in C 

Example 8.2. Let C'q be the first-order language with a single 2- 
place relation symbol, <, and countably many constant symbols, Cq for 
each g G Q. Let S include all the sentences 

(1) Cp < Cq, for every p, g e Q such that p < q, 

(2) ^x{^x < x), 

(3) \/x\/y{x < y\/ X — y\/ y < x), 

(4) \/x\/y\/z {x<y^{y<z^x< z)), 

(5) Mx'iy {x < y ^ 3z [x < z f\ z < y)), 

(6) 3y {x < y), and 

(7) yx3y{y < x). 

In effect, E asserts that < is a linear order on the universe (2-4) which 
is dense (5) and has no endpoints (6-7), and which has a suborder 
isomorphic to Q (1). Then C — {cq | g e Q } is a set of witnesses for 
E in Cq. 

In the example above, one can "reverse-engineer" a model for the set 
of sentences in question from the set of witnesses simply by letting the 
universe of the structure be the set of witnesses. One can also define the 
necessary relation interpreting < in a pretty obvious way from E.^ This 
example is obviously contrived: there are no constant symbols around 
which are not witnesses, S proves that distinct constant symbols aren't 
equal to to each other, there is little by way of non-logical symbols 
needing interpretation, and E explicitly includes everything we need to 
know about <. 

In general, trying to build a model for a set of sentences S in this 
way runs into a number of problems. First, how do we know whether 
E has a set of witnesses at all? Many first-order languages have few or 
no constant symbols, after all. Second, if E has a set of witnesses C, 
it's unlikely that we'll be able to get away with just letting the universe 
of the model be C. What if S h c = for some distinct witnesses c 
and Third, how do we handle interpreting constant symbols which 
are not in C? Fourth, what if E doesn't prove enough about whatever 
relation and function symbols exist to let us define interpretations of 
them in the structure under construction? (Imagine, if you like, that 
someone hands you a copy of Joyce's Ulysses and asks you to produce a 

^Note, however, that an isomorphic copy of Q is not the only structure for Cq 
satisfying E. For example, 9^ = (M, <, q + n: g € Q) will also satisfy E if we intepret 
Cq hy q + n. 
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complete road map of Dublin on the basis of the book. Even if it has no 
geographic contradictions, you are unlikely to find all the information 
in the novel needed to do the job.) Finally, even if E docs prove all we 
need to define functions and relations on the universe to interpret the 
function and relation symbols, just how do we do it? Getting around 
all these difficulties requires a fair bit of work. One can get around 
many by sticking to maximally consistent sets of sentences in suitable 
languages. 

Lemma 8.12. Suppose S is a set of sentences, ip is any formula, 
and X is any variable. Then (p if and only if 12 \-\/x ip. 

Theorem 8.13. SupposeV is a consistent set of sentences of C Let 
C he an infinite countable set of constant symbols which are not symbols 
of C, and let C = CUC be the language obtained by adding the constant 
symbols in C to the symbols of C. Then there is a maximally consistent 
set E of sentences of CJ such that F C E and C is a set of witnesses 
for E. 

This theorem allows one to use a certain measure of brute force: 
No set of witnesses? Just add one! The set of sentences doesn't decide 
enough? Decide everything one way or the other! 

Theorem 8.14. Suppose H is a maximally consistent set of sen- 
tences and C is a set of witnesses for E. Then there is a structure IXfl 
such that 3Jl 1= E. 

The important part here is to define 3Jl — proving that 2Jl |= E 
is tedious but fairly straightforward if you have the right definition. 
Proposition 6.17 now lets us deduce the fact we really need. 

Corollary 8.15. Suppose T is a consistent set of sentences of a 
first-order language C. Then there is a structure 9Jt for C satisfying F. 

With the above facts in hand, we can rejoin our proof of Soundness 

and Completeness, already in progress: 

Theorem 8.16. A set of sentences in C is consistent if and only 

if it is satisfiable. 

The rest works just like it did for propositional logic. 

Theorem 8.17 (Completeness Theorem). If a is a sentence and A 

is a set of sentences such that A |= a, then Aha. 

It follows that in a first-order logic, as in propositional logic, a 
sentence is implied by some set of premisses if and only if it has a proof 
from those premisses. 
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Theorem 8.18 (Compactness Theorem). A set of sentences A 
satisfiable if and only if every finite subset of A is satisfiable. 



CHAPTER 9 



Applications of Compactness 

After wading through the preceding chapters, it should be obvious 
that first-order logic is, in principle, adequate for the job it was origi- 
nally developed for: the essentially philosophical exercise of formalizing 
most of mathematics. As something of a bonus, first-order logic can 
supply useful tools for doing "real" mathematics. The Compactness 
Theorem is the simplest of these tools and glimpses of two ways of 
using it are provided below. 

From the finite to the infinite. Perhaps the simplest use of the 
Compactness Theorem is to show that if there exist arbitrarily large 
finite objects of some type, then there must also be an infinite object 
of this type. 

Example 9.1. We will use the Compactness Theorem to show that 
there is an infinite commutative group in which every element is of order 
2, i.e. such that g • g — e for every element g. 

Let jCg be the first-order language with just two non-logical sym- 
bols: 

• Constant symbol: e 

• 2-place function symbol: ■ 

Here e is intended to name the group's identity element and • the group 
operation. Let E be the set of sentences of jCq including: 

(1) The axioms for a commutative group: 

• \/x X ■ e — X 

• Wx3y X ■ y = e 

• WxWy\/zx ■ {y ■ z) = {x ■ y) ■ z 

• \/xyyy-x — x-y 

(2) A sentence which asserts that every element of the universe is 
of order 2: 

• \fx X ■ X = e 

(3) For each n > 2, a sentence, cr^, which asserts that there are at 
least n different elements in the universe: 

• 3xi . ..3Xn{{^Xi = X2)A(-iXi = X3)A- • • A(-iX„_i = Xn)) 
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We claim that every finite subset of E is satisfiable. The most 
direct way to verify this is to show how, given a finite subset A of E, 
to produce a model OJt of A. Let n be the largest integer such that 
cr„ G A U {o"2} (Why is there such an n?) and choose an integer k such 
that 2^ > n. Define a structure (G, o) for Cq as follows: 

• G = { (a^ I 1 < £ < A;) I a£ = or 1 } 

• {at\l<^<k)o {hi\l<l<k) = {a^ + hi (mod 2) | 1 < ^ < 
k) 

That is, G is the set of binary sequences of length k and o is coordi- 
natewise addition modulo 2 of these sequences. It is easy to check that 
(G, o) is a commutative group with 2^ elements in which every element 
has order 2. Hence (G, o) |= A, so A is satisfiable. 

Since every finite subset of E is satisfiable, it follows by the Com- 
pactness Theorem that E is satisfiable. A model of E, however, must 
be an infinite commutative group in which every clement is of order 
2. (To be sure, it is quite easy to build such a group directly; for ex- 
ample, by using coordinatewise addition modulo 2 of infinite binary 
sequences.) 

Problem 9.1. Use the Compactness Theorem to show that there is 
an infinite 

(1) bipartite graph, 

(2) non-commutative group, and 

(3) field of characteristic 3, 

and also give concrete examples of such objects. 

Most applications of this method, including the ones above, are 
not really interesting: it is usually more valuable, and often easier, to 
directly construct examples of the infinite objects in question rather 
than just show such must exist. Sometimes, though, the technique 
can be used to obtain a non-trivial result more easily than by direct 
methods. We'll use it to prove an important result from graph theory, 
Ramsey's Theorem. Some definitions first: 

Definition 9.1. If X is a set, let the set of unordered pairs of 
elements of X be [X^ = { {a, 6} | a, 6 e X and a ^ b}. (See Defini- 
tion A.l.) 

(1) A graph is a pair (V, E) such that K is a non-empty set and 
E C [y]^. Elements of V are called vertices of the graph and 
elements of E are called edges. 

(2) A subgraph of (V, E) is a pair (t/, F), where U dV and F ~ 
Er\\Uf. 
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(3) A subgraph {U, F) of {V, E) is a clique if F = [t/]^. 

(4) A subgraph ([/, F) of (V, E) is an independent set \i F — 

That is, a graph is some collection of vertices, some of which are 
joined to one another. A subgraph is just a subset of the vertices, 
together with all edges joining vertices of this subset in the whole graph. 
It is a clique if it happens that the original graph joined every vertex in 
the subgraph to all other vertices in the subgraph, and an independent 
set if it happens that the original graph joined none of the vertices in 
the subgraph to each other. The question of when a graph must have 
a clique or independent set of a given size is of some interest in many 
applications, especially in dealing with colouring problems. 

Theorem 9.2 (Ramsey's Theorem). For every n > 1 there is an 
integer Rn such that any graph with at least Rn vertices has a clique 
with n vertices or an independent set with n vertices. 

Rn is the nth Ramsey number. It is easy to see that Ri — 1 and 
i?2 = 2, but -R3 is already 6, and i?„ grows very quickly as a function 
of n thereafter. Ramsey's Theorem is fairly hard to prove directly, but 
the corresponding result for infinite graphs is comparatively straight- 
forward. 

Lemma 9.3. // (V, E) is a graph with infinitely many vertices, then 
it has an infinite clique or an infinite independent set. 

A relatively quick way to prove Ramsey's Theorem is to first prove 
its infinite counterpart. Lemma 9.3, and then get Ramsey's Theorem 
out of it by way of the Compactness Theorem. (If you're an ambitious 
minimalist, you can try to do this using the Compactness Theorem for 
propositional logic instead!) 

Elementary equivalence and non-standard models. One of 

the common uses for the Compactness Theorem is to construct "non- 
standard" models of the theories satisfied by various standard math- 
ematical structures. Such a model satisfies all the same first-order 
sentences as the standard model, but differs from it in some way not 
expressible in the first-order language in question. This brings home 
one of the intrinsic limitations of first-order logic: it can't always tell 
essentially different structures apart. Of course, we need to define just 
what constitutes essential difference. 

Definition 9.2. Suppose £ is a first-order language and 91 and 9Jt 
arc two structures for C. Then Dl and 0?t are: 

(1) isomorphic, written as Dl = 9Jt, if there is a function F: |Oft| ^ 
such that 
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(a) F is 1 — 1 and onto, 

(b) F(c^) — for every constant symbol c of C, 

(c) F(/^(ai, ...,ak) = f^iFiai), F{ak)) for every A:-place 
function symbol f of C and elements Oi, . . . , G and 

(d) P'"(ai, . . . , Ofc) holds if and only if P'"(F(ai), . . . , F{ak)) 
for every fc-place relation symbol of C and elements oi, 
■ ■ ■ , Ofe of |9T|; 

and 

(2) elementarily equivalent, written as = 9Jt, if Th(D'I) = Th(9Jl), 
i.e. ii^\= a if and only if 9Jt |= cr for every sentence a of C 

That is, two structures for a given language are isomorphic if they 
are structurally identical and elementarily equivalent if no statement 
in the language can distinguish between them. Isomorphic structures 
are elementarily equivalent: 

Proposition 9.4. Suppose C is a first-order language and and 
Wl are structures for L such that 01 = Then 91 = 

However, as the following application of the Compactness Theorem 
shows, elementarily equivalent structures need not be isomorphic: 

Example 9.2. Note that € = (N) is an infinite structure for C=. 
Expand C= to Cr by adding a constant symbol for every real number 
r, and let S be the set of sentences of C= including 

• every sentence r of Th(C), i.e. such that C |= r, and 

• ->Cr — Cs for every pair of real numbers r and s such that r ^ s. 

Every finite subset of E is satisfiable. (Why?) Thus, by the Compact- 
ness Theorem, there is a structure U' for Cr satisfying E, and hence 
Th(C). The structure 11 obtained by dropping the interpretations of 
all the constant symbols c,. from il' is then a structure for C= which 
satisfies Th(£). Note that |it| = is at least large as the set of all 
real numbers R, since il' requires a distinct element of the universe to 
interpret each constant symbol of Cr. 

Since Th(£) is a maximally consistent set of sentences of C= by 
Problem 8.6, it follows from the above that = il. On the other hand, 
£ cannot be isomorphic to il because there cannot be an onto map 
between a countable set, such as N = and a set which is at least 
as large as R, such as |il| . 

In general, the method used above can be used to show that if a 
set of sentences in a first-order language has an infinite model, it has 
many different ones. In C= that is essentially all that can happen: 
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Proposition 9.5. Two structures for C= are elementarily equiva- 
lent if and only if they are isomorphic or infinite. 

Problem 9.6. Lef^^ (N, 0, 1, S, +, ■, E) he the standard structure 
for Cn- Use the Compactness Theorem to show there is a structure 3Jl 
for Cn such that ^ = ^but not 01 = 9Jl. 

Note that because and 9Jt both satisfy Th(OT) , which is maximally 
consistent by Problem 8.6, there is absolutely no way of telling them 
apart in C^. 

Proposition 9.7. Every model o/Th(OI) which is not isomorphic 
to or has 

(1) an isomorphic copy o/OI embedded in it, 

(2) an infinite number, i.e. one larger than all of those in the copy 
o/9T, and 

(3) an infinite decreasing sequence. 

The apparent limitation of first-order logic that non-isomorphic 
structures may be elementarily equivalent can actually be useful. A 
non-standard model may have features that make it easier to work 
with than the standard model one is really interested in. Since both 
structures satisfy exactly the same sentences, if one uses these features 
to prove that some statement expressible in the given first-order lan- 
guage is true about the non-standard structure, one gets for free that 
it must be true of the standard structure as well. A prime example of 
this idea is the use of non-standard models of the real numbers con- 
taining infinitesimals (numbers which are infinitely small but different 
from zero) in some areas of analysis. 

Theorem 9.8. Let y\ = (M, 0, 1, +, •) be the field of real numbers, 
considered as a structure for C f ■ Then there is a model of Thi^Jl) which 
contains a copy of M and in which there is an infinitesimal. 

The non-standard models of the real numbers actually used in anal- 
ysis are usually obtained in more sophisticated ways in order to have 
more information about their internal structure. It is interesting to 

note that infinitesimals were the intuition behind calculus for Leibniz 
when it was first invented, but no one was able to put their use on a 
rigourous footing until Abraham Robinson did so in 1950. 
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Hints for Chapter 5. 

5.1. Try to disassemble each string using Definition 5.2. Note that 
some might be vahd terms of more than one of the given languages. 

5.2. This is similar to Problem 1.5. 

5.3. This is similar to Proposition 1.7. 

5.4. Try to disassemble each string using Definitions 5.2 and 5.3. 
Note that some might be valid formulas of more than one of the given 
languages. 

5.5. This is just like Problem 1.2. 

5.6. This is similar to Problem 1.5. You may wish to use your 
solution to Problem 5.2. 

5.7. This is similar to Proposition 1.7. 

5.8. You might want to rephrase some of the given statements to 
make them easier to formalize. 

(1) Look up associativity if you need to. 

(2) "There is an object such that every object is not in it." 

(3) This should be easy. 

(4) Ditto. 

(5) "Any two things must be the same thing." 

5.9. If necessary, don't hesitate to look up the definitions of the 
given structures. 

(1) Read the discussion at the beginning of the chapter. 

(2) You really need only one non-logical symbol. 

(3) There are two sorts of objects in a vector space, the vectors 
themselves and the scalars of the field, which you need to be 
able to tell apart. 

5.10. Use Definition 5.3 in the same way that Definition 1.2 was 
used in Definition 1.3. 
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5.11. The scope of a quantifier ouglit to be a certain subformula of 
the formula in which the quantifier occurs. 

5.12. Check to see whether they satisfy Definition 5.4. 

5.13. Check to see which pairs satisfy Definition 5.5. 

5.14. Proceed by induction on the length of using Definition 5.3. 

5.15. This is similar to Theorem 1.12. 

5.16. This is similar to Theorem 1.12 and uses Theorem 5.15. 
Hints for Chapter 6. 

6.1. In each case, apply Definition 6.1. 

(1) This should be easy. 

(2) Ditto. 

(3) Invent objects which are completely different except that they 
happen to have the right number of the right kind of compo- 
nents. 

6.2. Figure out the relevant values of s(fn) and apply Definition 

6.3. 

6.3. Suppose s and r both extend the assignment s. Show that 
s(i) = r(i) by induction on the length of the term t. 

6.4. Unwind the formulas using Definition 6.4 to get informal state- 
ments whose truth you can determine. 

6.5. Unwind the abbreviation 3 and use Definition 6.4. 

6.6. Unwind each of the formulas using Definitions 6.4 and 6.5 to 
get informal statements whose truth you can determine. 

6.7. This is much like Proposition 6.3. 

6.8. Proceed by induction on the length of the formula using Defi- 
nition 6.4 and Lemma 6.7. 

6.9. How many free variables does a sentence have? 

6.10. Use Definition 6.4. 

6.12. Unwind the sentences in question using Definition 6.4. 

6.11. Use Definitions 6.4 and 6.5; the proof is similar in form to 
the proof of Proposition 2.9. 

6.14. Use Definitions 6.4 and 6.5; the proof is similar in form to 
the proof for Problem 2.10. 
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6.15. Use Definitions 6.4 and 6.5 in each case, plus tlie meanings 
of our abbreviations. 

6.17. In one direction, you need to add appropriate objects to a 
structure; in the other, delete them. In both cases, you still have to 
verify that F is still satisfied. 

6.18. Here are some appropriate languages. 

(1) £= 

(2) Modify your language for graph theory from Problem 5.9 by 
adding a 1-place relation symbol. 

(3) Use your language for group theory from Problem 5.9. 
(4) 

Hints for Chapter 7. 

7.1. (1) Use Definition 7.1. 

(2) Ditto. 

(3) Ditto. 

(4) Proceed by induction on the length of the formula </?. 

7.2. Use the definitions and facts about |= from Chapter 6. 

7.3. Check each case against the schema in Definition 7.4. Don't 
forget that any generalization of a logical axiom is also a logical axiom. 

7.4. You need to show that any instance of the schemas A1-A8 is 
a tautology and then apply Lemma 7.2. That each instance of schemas 
A1-A3 is a tautology follows from Proposition 6.15. For A4-A8 you'll 
have to use the definitions and facts about |= from Chapter 6. 

7.5. You may wish to appeal to the deductions that you made or 
were given in Chapter 3. 

(1) Try using A4 and A6. 

(2) You don't need A4-A8 here. 

(3) Try using A4 and A8. 

(4) A8 is the key; you may need it more than once. 

(5) This is just A6 in disguise. 

7.6. This is just like its counterpart for propositional logic. 

7.7. Ditto. 

7.8. Ditto. 

7.9. Ditto. 



7.10. Ditto. 
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7.11. Proceed by induction on the length of the shortest proof of 
(/? from r. 

7.12. Ditto. 

7.13. As usual, don't take the following suggestions as gospel. 

(1) Try using A8. 

(2) Start with Example 7.1. 

(3) Start with part of Problem 7.5. 

Hints for Chapter 8. 

8.1. This is similar to the proof of the Soundness Theorem for 
propositional logic, using Proposition 6.10 in place of Proposition 3.2. 

8.2. This is similar to its counterpart for prpositional logic. Propo- 
sition 4.2. Use Proposition 6.10 instead of Proposition 3.2. 

8.3. This is just like its counterpart for propositional logic. 

8.4. Ditto. 

8.5. Ditto. 

8.6. This is a counterpart to Problem 4.6; use Proposition 8.2 in- 
stead of Proposition 4.2 and Proposition 6.15 instead of Proposition 
2.4. 

8.7. This is just like its counterpart for propositional logic. 

8.8. Ditto 

8.9. Ditto. 

8.10. This is much like its counterpart for propositional logic. The- 
orem 4.10. 

8.11. Use Proposition 7.8. 

8.12. Use the Generalization Theorem for the hard direction. 

8.13. This is essentially a souped-up version of Theorem 8.10. To 
ensure that C is a set of witnesses of the maximally consistent set of 
sentences, enumerate all the formulas </? of CJ with one free variable 
and take care of one at each step in the inductive construction. 
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8.14. To construct the required structure, 9Jt, proceed as follows. 
Define an equivalence relation ~ on C by setting c ~ d if and only if 
c = d E Tj, and let [c] = { a G C | a ~ c } be the equivalence class of 
c G C. The universe of 07t will be M = { [c] | c G C }. For each /c-place 
function symbol / define /™ by setting /™([ai], . . . , [ak]) = [b] if and 
only if /oi . . . afe = 6 is in E. Define the interpretations of constant 
symbols and relation symbols in a similar way. You need to show that 
all these things are well-defined, and then show that 9Jl |= E. 

8.15. Expand F to a maximally consistent set of sentences with a 
set of witnesses in a suitable extension of C, apply Theorem 8.14, and 
then cut down the resulting structure to one for C 

8.16. One direction is just Proposition 8.2. For the other, use 
Corollary 8.15. 

8.17. This follows from Theorem 8.16 in the same way that the 
Completeness Theorem for propositional logic followed from Theorem 
4.11. 

8.18. This follows from Theorem 8.16 in the same way that the 
Compactness Theorem for propositional logic followed from Theorem 
4.11. 

Hints for Chapter 9. 

9.1. In each case, apply the trick used in Example 9.1. For defi- 
nitions and the concrete examples, consult texts on combinatorics and 
abstract algebra. 

9.2. Suppose Ramsey's Theorem fails for some n. Use the Com- 
pactness Theorem to get a contradiction to Lemma 9.3 by showing 
there must be an infnite graph with no clique or independent set of 
size n. 

9.3. Inductively define a sequence ao, ai, . . . , of vertices so that for 
every n, either it is the case that for all A; > n there is an edge joining 

an to flfc or it is the case that for all A; > n there is no edge joining a„ 
to Ofc. There will then be a subsequence of the sequence which is an 
infinite clique or a subsequence which is an infinite independent set. 

9.4. The key is to figure out how, given an assignment for one 
structure, one should define the corresponding assignment in the other 
structure. After that, proceed by induction using the definition of 
satisfaction. 

9.5. When are two finite structures for C= elementarily equivalent? 
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9.6. In a suitable expanded language, consider Th(Ol) together with 
the sentences 3x0 + x — c, 3x SO + x — c, 3x SSO + x — c, ... 

9.7. Suppose DJt \= Th(D^) but is not isomorphic to 01. 

(1) Consider the subset of |2Jt| given by 0™, S^{0^), S^{S^{0^)), 

(2) If it didn't have one, it would be a copy of 91. 

(3) Start with a infinite number and work down. 

9.8. Expand Cp hj throwing in a constant symbol for every real 
number, plus an extra one, and take it from there. 



Part III 
Computability 



CHAPTER 10 



Turing Machines 

Of the various ways to formalize the notion an "effective method", 
the most commonly used are the simple abstract computers called Tur- 
ing machines, which were introduced more or less simultaneously by 
Alan Turing and Emil Post in 1936.^ Like most real-life digital com- 
puters, Turing machines have two main parts, a processing unit and 
a memory (which doubles as the input/output device), which we will 
consider separately before seeing how they interact. The memory can 
be thought of as an infinite tape which is divided up into cells like the 
frames of a movie. The Turing machine proper is the processing unit. 
It has a scanner or head which can read from or write to a single cell 
of the tape, and which can be moved to the left or right one cell at a 
time. 

Tapes. To keep things simple, in this chapter we will only allow 
Turing machines to read and write the symbols and 1. (One symbol 
per cell!) Moreover, we will allow the tape to be infinite in only one 
direction. That these restrictions do not affect what a Turing machine 
can, in principle, compute follows from the results in the next chapter. 

Definition 10.1. A tape is an infinite sequence 

a = ao fli 02 . . . 

such that for each integer i the cell ai G {0, 1}. The ith cell is said to 
be blank if is 0, and marked if is 1. 

A blank tape is one in which every cell is 0. 

Example 10.1. A blank tape looks like: 

000000000000000000000000 • • • 

The 0th cell is the leftmost one, cell 1 is the one immediately to the 
right, cell 2 is the one immediately to the right of cell 1, and so on. 
The following is a slightly more exciting tape: 

0101101110001000000000000000- • • 

^Both papers are reprinted in [6]. Post's brief paper gives a particularly lucid 
informal description. 
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In this case, cell 1 is marked {i.e. contains a 1), as do cells 3, 4, 5, 7, 
8, and 12; all the rest are blank {i.e. contain a 0). 

Problem 10.1. Write down tapes satisfying the following. 

(1) Entirely blank except for cells 3, 12, and 20. 

(2) Entirely marked except for cells 0, 2, and 3. 

(3) Entirely blank except that 1025 is written out in binary just to 

the right of cell 2. 

To keep track of which cell the Turing machine's scanner is at, plus 
which instruction the Turing machine is to execute next, we will usually 
attach additional information to our description of the tape. 

Definition 10.2. A tape position is a triple (s, i, a), where s and i 
are natural numbers with s > 0, and a is a tape. Given a tape position 
(s, i, a), we will refer to cell i as the scanned cell and to s as the state. 

Note that if (s, i, a) is a tape position, then the corresponding Tur- 
ing machine's scanner is presently reading (which is one of or 1). 

Conventions for tapes. Unless stated otherwise, we will assume 
that all but finitely many cells of any given tape are blank, and that any 
cells not explicitly described or displayed are blank. We will usually 
depict as little of a tape as possible and omit the • • • s we used above. 
Thus 

0101101110001 

represents the tape given in the Example 10.1. In many cases we will 
also use 2;" to abbreviate n consecutive copies of so the same tape 
could be represented by 

0101^01^0^1. 

Similarly, if cr is a finite sequence of elements of (0, 1}, we may write cr" 
for the sequence consisting of n copies of a stuck together end-to-end. 
For example, (010)=^ is short for 010010010. 

In displaying tape positions wc will usually underline the scanned 
cell and write s to the left of the tape. For example, we would display 
the tape position using the tape from Example 10.1 with cell 3 being 
scanned and state 2 as follows: 

2: 0101101110001 

Note that in this example, the scanner is reading a 1. 

Problem 10.2. Using the tapes you gave in the corresponding part 
of Problem 10.1, write down tape positions satisfying the following con- 
ditions. 
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(1) Cell 7 being scanned and state 4. 

(2) Cell 4 being scanned and state 3. 

(3) Cell 3 being scanned and state 413. 

Turing machines. The "processing unit" of a Turing machine is 
just a finite hst of specifications describing what the machine wiU do in 
various situations. (Remember, this is an abstract computer. . . ) The 
formal definition may not seem to amount to this at first glance. 

Definition 10.3. A Turing machine is a function M such that for 
some natural number n, 

dom(M) C {l,...,n} X {0, 1} 

= { (s, 6) I 1 < s < n and 6 e {0, 1} } 

and 

ran(M) C {0, 1} x {-1, 1} x {1, . . . , n} 

= {{c,d,t)\ce {0, 1} and d e {-1, 1} and 1 < i < n } . 

Note that M does not have to be defined for all possible pairs 

{s,b)e{l,...,n}x{0,l}. 

We will sometimes refer to a Turing machine simply as a machine 
or TM . If n > 1 is least such that M satisfies the definition above, we 
shall say that M is an n-state Turing machine and that {1, . . . ,n} is 
the set of states of M. 

Intuitively, we have a processing unit which has a finite list of basic 
instructions, the states, which it can execute. Given a combination of 
current state and the symbol marked in the currently scanned cell of 
the tape, the list specifies 

• a symbol to be written in the currently scanned cell, overwrit- 
ing the symbol being read, then 

• a move of the scanner one cell to the left or right, and then 

• the next instruction to be executed. 

That is, M(s, c) = (6, d, t) means that if our machine is in state s {i.e. 
executing instruction number s) and the scanner is presently reading a 
c in cell i, then the machine M should 

• set tti = b {i.e. write b instead of c in the scanned cell), then 

• move the scanner to Oj+d {i.e. move one cell left ii d = —1 and 
one cell right if ci = 1), and then 

• enter state t {i.e. go to instruction t). 
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If our processor isn't equipped to handle input c for instruction s {i.e. 
M{s,c) is undefined), then the computation in progress will simply 
stop dead or halt. 

Example 10.2. We will usually present Turing machines in the 
form of a tabic, with a row for each state and a column for each possible 
entry in the scanned cell. Instead of —1 and 1, we will usually use L 
and R when writing such tables in order to make them more readable. 
Thus the table 



M 





1 


1 


1R2 


ORl 


2 


0L2 





defines a Turing machine M with two states such that M(1,0) = 
(1,1,2), M(l,l) = (0,1,1), and M(2, 0) = (0,-1,2), but M(2, 1) is 
undefined. In this case M has domain { (1, 0), (1, 1), (2, 0) } and range 
{ (1, 1, 2), (0, 1, 1), (0, -1, 2) }. If the machine M were faced with the 
tape position 

1: 01001111, 

it would, since it was in state 1 while scanning a cell containing 0, 

• write a 1 in the scanned cell, 

• move the scanner one cell to the right, and 

• go to state 2. 

This would give the new tape position 

2: 01011111. 

Since M doesn't know what to do on input 1 in state 2, it would then 
halt, ending the computation. 

Problem 10.3. In each case, give the table of a Turing machine 
M meeting the given requirement. 

(1) M has three states. 

(2) M changes to 1 and vice versa in any cell it scans. 

(3) M is as simple as possible. How many possibilities are there 

here? 

Computations. Informally, a computation is a sequence of actions 
of a machine M on a tape according to the rules above, starting with 
instruction 1 and the scanner at cell on the given tape. A computation 
ends (or halts) when and if the machine encounters a tape position 
which it does not know what to do in If it never halts, and doesn't 
crash by running the scanner off the left end of the tape^ either, the 



Be warned that most authors prefer to treat running the scanner off the left 
end of the tape as being just another way of halting. Halting with the scanner 
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computation will never end. The formal definition makes all this seem 
much more formidable. 

Definition 10.4. Suppose M is a Turing machine. Then: 

• If p = {s,i,a) is a tape position and M(s, Oj) = {b,d,t) is 
defined, then M(p) = {t, i+d, a') is the successor tape position, 
where a[ = h and a'^ = aj whenever j ^ i. 

• A partial computation with respect to M is a sequence piP2 ■ ■ ■ 
of tape positions such that pe+i = M(p£) for each i < k. 

• A partial computation pip2 ■ ■ - Pk with respect to M is a com- 
putation (with respect to M) with input tape a if pi = (1, 0, a) 
and M(pfc) is undefined (and not because the scanner would 
run off the end of the tape). The output tape of the computa- 
tion is the tape of the final tape position pk- 

Note that a partial computation is a computation only if the Turing 
machine halts but doesn't crash in the final tape position. The require- 
ment that it halt means that any computation can have only finitely 
many steps. Unless stated otherwise, we will assume that every partial 
computation on a given input begins in state 1. We will often omit 
the "partial" when speaking of computations that might not strictly 
satisfy the definition of computation. 

Example 10.3. Let's sec the machine M of Example 10.2 perform 
a computation. Our input tape will be a = 1100, that is, the tape 
which is entirely blank except that — ai — 1. The initial tape 
position of the computation of M with input tape a is: 

1 : 1100 

The subsequent steps in the computation are: 

1 : 0100 
1 : 0000 
2: 0010 
2: 001 

We leave it to the reader to check that this is indeed a partial com- 
putation with respect to M. Since M(2, 1) is undefined the process 
terminates at this point and this partial computation is therefore a 
computation. 



on the tape is more convenient, however, when putting together different Turing 
machines to make more complex ones. 
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Problem 10.4. Give the (partial) computation of the Turing ma- 
chine M of Example 10.2 starting in state 1 with the input tape: 

(1) 00 

(2) 110 

(3) The tape with all cells marked and cell 5 being scanned. 

Problem 10.5. For which possible input tapes does the partial com- 
putation of the Turing machine M of Example 10.2 eventually termi- 
nate? Explain why. 

Problem 10.6. Find a Turing machine that (eventually!) fills a 
blank input tape with the pattern 010110001011000101100 . . . . 

Problem 10.7. Find a Turing machine that never halts (or crashes), 

no matter what is on the tape. 

Building Turing Machines. It will be useful later on to have a 
library of Turing machines that manipulate blocks of Is in various ways, 
and very useful to be able to combine machines peforming simpler tasks 
to perform more complex ones. 

Example 10.4. The Turing machine S given below is intended to 
halt with output Ol'^O on input Ol'^, ilk > 0; that is, it just moves past 
a single block of Is without disturbing it. 



s 





1 


1 






2 




li?2 



Trace this machine's computation on, say, input 01^ to see how it works. 

The following machine, which is itself a variation on does the 
reverse of what S does: on input 01*^0 it halts with output 01*^. 



T 





1 


1 


0L2 




2 




1L2 



We can combine S and T into a machine U which does nothing to 
a block of Is: given input Ol'^ it halts with output 01*'. (Of course, a 
better way to do nothing is to really do nothing!) 



T 





1 


1 


0i?2 




2 


0L3 


IR2 


3 




1L3 



Note how the states of T had to be renumbered to make the combina- 
tion work. 
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Example 10.5. The Turing machine P given below is intended to 
move a block of Is: on input 00" l'^, where n > and A; > 0, it halts 
with output Ol'^. 



p 





1 


1 


0i?2 




2 


im 


1L8 


3 


0i?3 


om 


4 


0i?7 


1L5 


5 


0L5 


1R6 


6 


im 




7 


0L7 


1L8 


8 




ILS 



Trace P's computation on, say, input 00^1^ to see how it works. Trace 
it on inputs 01^ and 00^1 as well to see how it handles certain special 
cases. 

Note. In both Examples 10.4 and 10.5 we do not really care what 
the given machines do on other inputs, so long as they perform as 
intended on the particular inputs we are concerned with. 

Problem 10.8. We can combine the machine P of Example 10.5 
with the machines S and T of Example 10. 4 to get the following ma- 
chine. 



R 





1 


1 


0R2 




2 


0R3 


1R2 


3 


1R4: 


1L9 


4 


ORA 


ORB 


5 


0R8 


ILQ 


6 


0L6 


1R7 


7 


1R4 




8 


0L8 


1L9 


9 


OLIO 


1L9 


10 




ILIO 



What task involving blocks of Is is this machine intended to perform? 

Problem 10.9. In each case, devise a Turing machine that: 

(1) Halts with output 01^ on input 0. 

(2) Halts with output 01"0 on input 00" 1. 

(3) Halts with output 01^'^^ on input 01'^. 

(4) Halts with output 0(10)'^ on input 01'^. 

(5) Halts with output 01"* on input 01"01"* whenever n,m > 0. 
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(6) Halts with output m^'^QV'^V' on mj>n^ Or^Ol^Ol™, ifn,m,k> 
0. 

(7) Halts with output 01"*01"0l'=01"*01"0l'= on input 01"*01"01^ 

if n, rrijk > 0. 

(8) On input Ol^Ol", where m,n > 0, halts with output 01 if 
m ^ n and output Oil if m — n. 

It doesn't matter what the machine you define in each case may do on 
other inputs, so long as it does the right thing on the given one(s). 
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Variations and Simulations 

The definition of a Turing macliine given in Cliapter 10 is arbitrary 
in a number of ways, among tliem tlie use of tlie symbols and 1, a 
single read-write scanner, and a single one-way infinite tape. One could 
further restrict the definition we gave by allowing 

• the machine to move the scanner only to one of left or right in 
each state, 

or expand it by allowing the use of 

• any finite alphabet of at least two symbols, 

• separate read and write heads, 

• multiple heads, 

• two-way infinite tapes, 

• multiple tapes, 

• two- and higher-dimensional tapes, 

or various combinations of these, among many other possibilities. We 
will construct a number of Turing machines that simulate others with 
additional features; this will show that various of the modifications 
mentioned above really change what the machines can compute. (In 
fact, none of them turn out to do so.) 

Example 11.1. Consider the following Turing machine: 



M 





1 


1 


im 


OLl 


2 


0L2 


ILl 



Note that in state 1, this machine may move the scanner to ei- 
ther the left or the right, depending on the contents of the cell being 
scanned. We will construct a Turing machine using the same alpha- 
bet that emulates the action of M on any input, but which moves the 
scanner to only one of left or right in each state. There is no problem 
with state 2 of M, by the way, because in state 2 M always moves the 
scanner to the left. 

The basic idea is to add some states to M which replace part of the 
description of state 1. 
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1 


1 


1R2 


0R3 


2 


0L2 


ILl 


3 


0L4 


1L4 


4 


OLl 





This machine is just hke M except that in state 1 with input 1, 
instead of moving the scanner to the left and going to state 1, the 
machine moves the scanner to the right and goes to the new state 3. 
States 3 and 4 do nothing between them except move the scanner two 
cells to the left without changing the tape, thus putting it where M 
would have put it, and then entering state 1, as M would have. 

Problem 11.1. Compare the computations of the machines M and 
M' of Example 11.1 on the input tapes 

(1) 

(2) Oil 

and explain why is it not necessary to define M' for state 4 on input 1. 

Problem 11.2. Explain in detail how, given an arbitrary Turing 
machine M, one can construct a machine M' that simulates what M 
does on any input, hut which moves the scanner only to one of left or 
right in each state. 

It should be obvious that the converse, simulating a Turing machine 
that moves the scanner only to one of left or right in each state by an 
ordinary Turing machine, is easy to the point of being trivial. 

It is often very convenient to add additional symbols to the alphabet 
that Turing machines are permitted to use. For example, one might 
want to have special symbols to use as place markers in the course of 
a computation. (For a more spectacular application, see Example 11.3 
below.) It is conventional to include 0, the "blank" symbol, in an 
alphabet used by a Turing machine, but otherwise any finite set of 
symbols goes. 

Problem 11.3. How do you need to change Definitions 10.1 and 
10.3 to define Turing machines using a finite alphabet E? 

While allowing arbitary alphabets is often convenient when design- 
ing a machine to perform some task, it doesn't actually change what 
can, in principle, be computed. 

Example 11.2. Consider the machine W below which uses the 
alphabet {0, a;, |/, 2;}. 



w 





X 


y 


z 


1 


ORl 


xRl 


0L2 


zRl 
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For example, on input Oxzyxy, W will eventually halt with output 
OxzOxy. Note that state 2 of 1^ is used only to halt, so we don't bother 
to make a row for it on the table. 

To simulate W with a machine Z using the alphabet {0, 1}, we first 
have to decide how to represent W^s tape. We will use the following 
scheme, arbitrarily chosen among a number of alternatives. Every cell 
of W^s tape will be represented by two consecutive cells of Z's tape, 
with a on VT's tape being stored as 00 on Z's, an x as 01, a y as 10, 
and 11. Thus, if W had input tape Qxzyxy, the corresponding 

input tape for Z would be 000111100110. 

Designing the machine Z that simulates the action of W on the 
representation of VF's tape is a little tricky. In the example below, 
each state of W corresponds to a "subroutine" of states of Z which 
between them read the information in each representation of a cell of 
VF's tape and take appropriate action. 



z 





1 


1 


0R2 


1R3 


2 


OLA 


1L6 


3 


0L8 


1L13 


4 


0R5 




5 


ORl 




6 


0R7 




7 
8 




IRl 
0R9 


9 


OLIO 




10 


OLll 




11 


0L12 


1L12 


12 


0L15 


1L15 


13 




IRU 


14 




IRl 



States 1-3 of Z read the input for state 1 of 1^ and then pass on control 
to subroutines handling each entry for state 1 in 1^'s table. Thus states 
4-5 of Z take action for state 1 of on input 0, states 6-7 of Z take 
action for state 1 of on input x, states 8-12 of Z take action for 
state 1 of on input y, and states 13-14 take action for state 1 of W 
on input z. State 15 of Z does what state 2 of does: nothing but 
hah. 

Problem 11.4. Trace the (partial) computations ofW, and their 
counterparts for Z, for the input Oxzyxy for W . Why is the subroutine 
for state lofWon input y so much longer than the others? How much 
can you simplify it? 
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Problem 11.5. Given a Turing machine M with an arbitrary al- 
phabet explain in detail how to construct a machine N with alphabet 
{0, 1} that simulates M. 

Doing the converse of this problem, simulating a Turing machine 
with alphabet {0, 1} by one using an arbitrary alphabet, is pretty easy. 

To define Turing machines with two-way infinite tapes we need only 
change Definition 10.1: instead of having tapes a — aoaia2 ■ ■ ■ indexed 
by N, we let them be b = . . . 6_2&-i&o^i^2 • • ■ indexed by Z. In defining 
computations for machines with two-way infinite tapes, we adopt the 
same conventions that we did for machines with one-way infinite tapes, 
such as having the scanner start off scanning cell on the input tape. 
The only real difference is that a machine with a two-way infinite tape 
cannot crash by running off the left end of the tape; it can only stop 
by halting. 

Example 11.3. Consider the following two-way infinite tape Turing 
machine with alphabet {0, 1}: 



T 





1 


1 


ILl 


0R2 


2 


0R2 


ILl 



To emulate T with a Turing machine O that has a one-way infinite 
tape, we need to decide how to represent a two-way infinite tape on a 
one-way infinite tape. This is easier to do if we allow ourselves to use 
an alphabet for O other than {0, 1}, chosen with malice aforethought: 

roiooiiT. 

IS' S' 0' 1' 0' iJ 

Wc can now represent the tape a = . . . a_2a-iCoaiO'2 ... for T by the 
tape a' = "gO ^"_2 ... for O. In effect, this trick allows us to split O's 
tape into two tracks, each of which accomodates half of the tape of T. 

To define O, we split each state of T into a pair of states for O, 
one for the lower track and one for the upper track. One must take 
care to keep various details straight: when O changes a "cell" on one 
track, it should not change the corresponding "cell" on the other track; 
directions are reversed on the lower track; one has to "turn a corner" 
moving past cell 0; and so on. 



















1 


1 


1 


1 


ILl 


IR3 


ILl 


\Ll 


»/?2 


IR2 


\R2 


2 


IR2 


IR2 


IR2 


»i?2 


IR3 


ILl 


\Ll 


3 


°R3 


IR3 


°R3 


"LA 




IR2 


IR3 


ILA 


4 


0L4 


IR2 


"LA 




»i?3 


IR3 


ILA 


\R3 
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States 1 and 3 are the upper- and lower-track versions, respectively, 
of T's state 1; states 2 and 4 are the upper- and lower-track versions, 
respectively, of T's state 2. We leave it to the reader to check that O 
actually does simulate T. . . 

Problem 11.6. Trace the (partial) computations of T , and their 
counterparts for O, for each of the following input tapes for T: 

(1) (i.e. a blank tape) 

(2) 10 

(3) ... 1111111 . . . (I.e. every cell marked with 1 ) 

Problem 11.7. Explain in detail how, given a Turing machine N 
with alphabet E and a two-way infinite tape, one can construct a Turing 
machine P with an one-way infinite tape that simulates N. 

Problem 11.8. Explain in detail how, given a Turing machine P 
with alphabet S and an one-way infinite tape, one can construct a Tur- 
ing machine N with a two-way infinite tape that simulates P. 

Combining the techniques we've used so far, we could simulate any 
Turing machine with a two-way infinite tape and arbitrary alphabet by 
a Turing machine with a one-way infinite tape and alphabet {0, 1}. 

Problem 11.9. Give a precise definition for Turing machines with 
two tapes. Explain how, given any such machine, one could construct 
a single-tape machine to simulate it. 

Problem 11.10. Give a precise definition for Turing machines with 
two-dimensional tapes. Explain how, given any such machine, one 
could construct a single-tape machine to simulate it. 

These results, and others like them, imply that none of the variant 
types of Turing machines mentioned at the start of this chapter differ 
essentially in what they can, in principle, compute. 

In Chapter 14 we will construct a Turing machine that can simulate 
any (standard) Turing machine. 



CHAPTER 12 



Computable and Non-Computable Functions 

A lot of computational problems in the real world have to do with 
doing arithmetic, and any notion of computation that can't deal with 
arithmetic is unlikely to be of great use. 

Notation and conventions. To keep things as simple as pos- 
sible, we will stick to computations involving the natural numbers, 
i.e. the non-ncgativc integers, the set of which is usually denoted by 
N = { 0, 1, 2, . . . }.. The set of all A;-tuples (ni, . . . , Uk) of natural num- 
bers is denoted by N^. For all practical purposes, we may take to 
be N by identifying the 1-tuple (n) with the natural number n. 

For A; > 1, / is a k-place function (from the natural numbers to the 
natural numbers), often written as /: N*^ — > N, if it associates a value, 
/(ni, . . . , Uk), to each A;-tuple (ni, n2, . . . , Uk) € N''. Strictly speaking, 
though we will frequently forget to be explicit about it, we will often 
be working with A;-place partial functions which might not be defined 
for all the /c-tuples in N'^. If / is a /c- place partial function, the domain 
of / is the set 

dom(/) = { (ni, . . . e N'' | /(ni, . . . is defined} . 
Similarly, the range of / is the set 

ran(/) = { /(m, . . . , n^) G N | (ni, . . . , n^) G dom(/) } . 

In subsequent chapters wc will also work with relations on the nat- 
ural numbers. Recall that a k-place relation on N is formally a subset 
P of N^; P{ni, . . . , Uk) is true if (ni, . . . , n^) e P and false otherwise. 
In particular, a 1-place relation is really just a subset of N. 

Relations and functions are closely related. All one needs to know 
about a fc-place function / can be obtained from the {k + l)-place 
relation Pf given by 



Pf{ni, . . . ,nk,nk+i) <^=^ /(ni, . . . , rife) = nfe+i 
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Similarly, all one needs to know about the /c-place relation P can be 
obtained from its characteristic function : 



The basic convention for representing natural numbers on the tape 
of a standard Turing machine is a slight variation of unary notation : n 
is represented by 1"+^. (Why would using 1" be a bad idea?) A fc-tuple 
(ni, 712, . . . , rife) e N will be represented by I"i+i01"2+i0 . . . 01"* +^ i.e. 
with the representations of the individual numbers separated by Os. 
This scheme is inefficient in its use of space — compared to binary 
notation, for example — but it is simple and can be implemented on 
Turing machines restricted to the alphabet {!}. 

Turing computable functions. With suitable conventions for 
representing the input and output of a function on the natural numbers 
on the tape of a Turing machine in hand, we can define what it means 
for a function to be computable by a Turing machine. 

Definition 12.1. A /c-place function / is Turing computable, or 
just computable, if there is a Turing machine M such that for any 
/c-tuple (ni, . . . , nfe) e dom(/) the computation of M with input tape 
Oi"i+ioi"2+i . . . 01"*=+^ eventually halts with output tape 01-^("i' - '"*)+i. 
Such a machine M is said to compute f. 

Note that for a Turing machine M to compute a function /, M 
need only do the right thing on the right kind of input: what M docs 
in other situations does not matter. In particular, it does not matter 
what M might do with fc-tuple which is not in the domain of /. 

Example 12.1. The identity function i^: N — > N, i.e. if<i{n) = n, 
is computable. It is computed by M = 0, the Turing machine with an 
empty table that does absolutely nothing on any input. 

Example 12.2. The projection function ttJ : — > N given by 
Trf{n, m) — n is computed by the Turing machine: 




1 if P{ni, . . . , Uk) is true; 
if P{ni, . . . ,nk) is false. 








1 



1 
2 
3 
4 
5 



0R2 
0R3 
OLA 
0L4 



17?2 
0R3 
1L5 
1L5 
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Pi acts as follows: it moves to the right past the first block of Is 
without disturbing it, erases the second block of Is, and then returns 
to the left of first block and halts. 

The projection function 7r| : ^ N given by 7r|(n, m) = m is also 
computable: the Turing machine P of Example 10.5 does the job. 

Problem 12.1. Find Turing machines that compute the following 
functions and explain how they work. 



We will consider methods for building functions computable by Tur- 
ing machines out of simpler ones later on. 

A non-computable function. In the meantime, it is worth ask- 
ing whether or not every function on the natural numbers is com- 
putable. No such luck! 

Problem 12.2. Show that there is some 1-place function / : N — > N 
which is not computable by comparing the number of such functions to 
the number of Turing machines. 

The argument hinted at above is unsatisfying in that it tells us there 
is a non-computable function without actually producing an explicit 
example. We can have some fun on the way to one. 

Definition 12.2 (Busy Beaver Competition). A machine M is an 
n-state entry in the busy beaver competition if: 

• M has a two-way infinite tape and alphabet {1} (see Chap- 
ter 11; 

• M has n+1 states, but state n -I- 1 is used only for halting (so 
both M{n + 1, 0) and M{n + 1, 1) are undefined); 

• M eventually halts when given a blank input tape. 

M's score in the competition is the number of I's on the output tape of 
its computation from a blank input tape. The greatest possible score 
of an n-state entry in the competition is denoted by E(n). 



(1) 
(2) 
(3) 



0(n) = 0. 
S(n) =n + 1. 
SuM(n, m) —n + m. 




(6) 
(7) 



7r|(p,g,r) = q. 
7rf(ai, ...,ai,. 
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Note that there are only finitely many possible n-statc entries in the 
busy beaver competition because there are only finitely many (n + 1)- 
statc Tiiring machines with alphabet {1}. Since there is at least one 
n-statc entry in the busy beaver competition for every n > , it follows 
that is well-defined for each n G N. 

Example 12.3. M = is the only 0-state entry in the busy beaver 
competition, so S(0) = 0. 

Example 12.4. The machine P given by 



p 





1 


1 


1R2 


1L2 


2 


ILl 


1L3 



is a 2-state entry in the busy beaver competition with a score of 4, so 
E(2) > 4. 

The function S grows extremely quickly. It is known that S(0) = 0, 
S(l) = 1, S(2) = 4, S(3) = 6, and S(4) = 13. The value of E(5) is 
still unknown, but must be quite large. ^ 

Problem 12.3. Show that: 

(1) The 2-state entry given in Example 12.4 actually scores 4. 

(2) E(l) = 1. 

(3) S(3) > 6. 

(4) E(n) < E(n + 1) for every n eN. 

Problem 12.4. Devise as high-scoring 4- and 5-state entries in the 
bvsy beaver competition as you can. 

The serious point of the busy beaver competition is that the func- 
tion E is not a Turing computable function. 

Proposition 12.5. E is not computable by any Turing machine. 

Anyone interested in learning more about the busy beaver com- 
petition should start by reading the paper [16] in which it was first 
introduced. 

Building more computable functions. One of the most com- 
mon methods for assembling functions from simpler ones in many parts 
of mathematics is composition. It turns out that compositions of com- 
putable functions are computable. 

^The best score known to the author by a 5-state entry in the busy beaver 

competition is 4098. One of the two machines achieving this score docs so in a 
computation that takes over 40 milhon steps! The other requires only 11 milhon or 
so. . . 
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Definition 12.3. Suppose that m, k > 1, g is an m-place function, 
and hi, . . . , hm are A;- place functions. Then the /c- place function / is 
said to be obtained from g, hi, . . . , hm hy composition, written as 

f ^ g o {hi,. . . ,hm) : 

if for all {ni,...,nk) G N'', 

f{ni, . . . , nfe) = g{hi{ni, Uk), hm{ni, nu)). 

Example 12.5. The constant function c\, where c\{n) — 1 for all 
n, can be obtained by composition from the functions S and O. For 

any n e N, 

c\{n) = (S o 0)(n) = S(0(n)) = S(0) = + 1 = 1. 

Problem 12.6. Suppose k > 1 and a G N. Use composition 
to define the constant function c^, where c^(ni, . . . , n^) = a for all 
(ni, . . . , Uk) G N'' , from functions already known to be computable. 

Proposition 12.7. Suppose that 1 < k, 1 < m, g is a Turing 
computable m-place function, and hi, hm are Turing computable 
k-place functions. Then g o [hi, . . . , hm) is also Turing computable. 

Starting with a small set of computable functions, and applying 
computable ways (such as composition) of building functions from sim- 
pler ones, we will build up a useful collection of computable functions. 
This will also provide a characterization of computable functions which 
does not mention any type of computing device. 

The "small set of computable functions" that will be the fundamen- 
tal building blocks is infinite only because it includes all the projection 
functions. 

Definition 12.4. The following are the initial functions: 

• O, the 1-place function such that 0(n) =0 for all n G N; 

• S, the 1-place function such that S(n) — n + 1 for all n G N; 
and, 

• for each k > 1 and 1 < i < k, Trf , the A;-place function such 
that 7rf (ni, . . . , Uk) = n-i for all {rii, . . . , Uk) G N'^. 

O is often referred to as the zero function, S is the successor function, 
and the functions 7rf are called the projection functions. 

Note that n\ is just the identity function on N. 

We have already shown, in Problem 12.1, that all the initial func- 
tions are computable. It follows from Proposition 12.7 that every func- 
tion defined from the initial functions using composition (any number 
of times) is computable too. Since one can build relatively few func- 
tions from the initial functions using only composition. . . 
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Proposition 12.8. Suppose f is a 1-place function obtained from 
the initial functions by finitely many applications of composition. Then 
there is a constant c e N such that f{n) <n-\-c for all n e N. 

... in the next chapter we will add other methods of building func- 
tions to our repertoire that will allow us to build all computable func- 
tions from the initial functions. 



CHAPTER 13 



Recursive Functions 

We will add two other methods of building computable functions 
from computable functions to composition, and show that one can use 
the three methods to construct all computable functions on N from the 
initial functions. 

Primitive recursion. The second of our methods is simply called 
recursion in most parts of mathematics and computer science. His- 
torically, the term "primitive recursion" has been used to distinguish 
it from the other recursive method of defining functions that we will 
consider, namely unbounded minimalization. ... Primitive recursion 
boils down to defining a function inductively, using different functions 
to tell us what to do at the base and inductive steps. Together with 
composition, it suffices to build up just about all familiar arithmetic 
functions from the initial functions. 

Definition 13.1. Suppose that k > 1, g is a A;-placc function, and 
h is a. k + 2-place function. Let / be the {k + l)-place function such 
that 

(1) /(ni, . . . , nfc, 0) = g{ni, ...,nk) and 

(2) /(ni, . . . , nfc, m + 1) = h {rii, . . . ,nk,m, f{ni, ...,nk, m)) 

for every (ni, . . . , Uk) G N'^ and m e N. Then / is said to be obtained 
from g and h by primitive recursion. 

That is, the initial values of / are given by ^f, and the rest are given 
by h operating on the given input and the preceding value of /. 

For a start, primitive recursion and composition let us define addi- 
tion and multiplication from the initial functions. 

Example 13.1. SUM(n, m) = n + m is obtained by primitive recur- 
sion from the initial function tt^ and the composition S o 7r| of initial 
functions as follows: 

• SuM(n, 0) = Ti\{n); 

• SuM(n, m -M) = (S o 7r|)(n, m, SuM(n, m)). 

To see that this works, one can proceed by induction on m: 
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At the base step, m = 0, we have 

SUM(n, 0) = 7r}(n) = n = n + . 
Assume that m > and SUM(n, m) = n + m. Then 
SuM(n, m + 1) = (S o7r|)(n,m, SuM(n,m)) 
= S(7r|(n, m, SuM(n, m))) 
= S(SuM(n,m)) 
= SUM(n,m) + 1 
= n + m + 1 , 

as desired. 

As addition is to the successor function, so multiphcation is to 
addition. 

Example 13.2. MuLT(n, m) = nm is obtained by primitive recur- 
sion from O and Sum o (vrf, tt^): 

• MuLT(n,0) = 0(n); 

• MuLT(n, m + 1) = (Sum o (7r|, 7rf))(n, m, MuLT(n, m)). 
We leave it to the reader to check that this works. 

Problem 13.1. Use composition and primitive recursion to obtain 
each of the following functions from the initial functions or other func- 
tions already obtained from the initial functions. 

(1) Exp(n, m) = n"* 

(2) PRED(n) (defined in Problem 12.1) 

(3) DlFF(n, rn) (defined in Problem 12.1) 

(4) FACT(n) = n\ 

Proposition 13.2. Suppose k > 1, g is a Turing computable k- 
place function, and h is a Turing computable {k + 2)-place function. If 
f is obtained from g and h by primitive recursion, then f is also Turing 
computable. 

Primitive recursive functions and relations. The collection of 
functions which can be obtained from the initial functions by (possibly 
repeatedly) using composition and primitive recursion is useful enough 
to have a name. 

Definition 13.2. A function / is primitive recursive if it can be 
defined from the initial functions by finitely many applications of the 
operations of composition and primitive recursion. 

So we already know that all the initial functions, addition, and 
multiplication, among others, are primitive recursive. 



13. RECURSIVE FUNCTIONS 89 

Problem 13.3. Show that each of the following functions is prim- 
itive recursive. 

(1) For any k > and primitive recursive {k + 1) -place function 
g, the {k + l)-place function f given by 

/(ni, . . . , nfe, m) = n^o5r(ni, . . . , n^, i) 

= g{ni, . . . ,nk,0) ■ . . . • g{ni, ...,nk,m). 



(2) For any constant a e X{a}\ 



n] 



a 

1 n — a . 



/on;,/ n J/(^l,---,^fc) (ni,...,nfc) 7^ (ci,...,Cfc) 

ya = (ci,...,Cfej 

/ is a primitive recursive k-place function and a,ci, . . . ,Ck G N 
are constants. 

Theorem 13.4. Every primitive recursive function is Turing com- 
putable. 

Be warned, however, that there are computable functions which are 
not primitive recursive. 

We can extend the idea of "primitive recursive" to relations by using 
their characteristic functions. 

Definition 13.3. Suppose A; > 1. A k-place relation P C N'' is 
primitive recursive if its characteristic function 

1 (m, ...,nk)eP 



Xp{ni,...,nk) = 



{ni, . . . ,nk) i P 



is primitive recursive. 

Example 13.3. P = {2} c N is primitive recursive since X{2} is 
recursive by Problem 13.3. 

Problem 13.5. Show that the following relations and functions are 
primitive recursive. 

(1) -iP, i.e. N'' \ P, if P is a primitive recursive k-place relation. 

(2) P y Q, i.e. P UQ, if P and Q are primitive recursive k-place 

relations. 

(3) P A Q, i.e. P nQ, if P and Q are primitive recursive k-place 
relations. 

(4) Equal, w/iere EQUAL(n,m) 4=^ n = m. 

(5) /i(ni, . . . , nfc, m) = Y.T=q di^i, ...,nk,i), for any k > and 
primitive recursive {k + 1) -place function g. 

(6) DiV; where Div(n, m) <^=^ n \ m. 
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(7) IsPrime, where IsPRlME(n) <^=^ n is prime. 

(8) Prime(/c) = pk, where po — 1 andpk is the kth prime ifk > 1. 

(9) PoWER(n, m) = k, where k >0 is maximal such that \ m. 

(10) LENGTH(n) = i, where i is maximal such that pe \ n. 

(11) ELEMENT(n, i) = rii, if n = p^^ . . .p^^ (and Ui = if i > k). 

(12) SuBSEQ(n, i, j) = • • • ^fl<^<J<k^ 

I otherwise 

ever n — p"^ . . . p^* . 

(13) CONCAT(n, m) = . . .p^'^p^i . . .p^l,, ifn = . . .p^'= and 
m = p7*i ...pf^ 

Parts of Problem 13.5 give us tools for representing finite sequences 
of integers by single integers, as well as some tools for manipulating 
these representations. This lets us reduce, in principle, all problems 
involving primitive recursive functions and relations to problems in- 
volving only 1-place primitive recursive functions and relations. 

Theorem 13.6. A k-place g is primitive recursive if and only if 
the 1-place function h given by h{n) — g{ni, . . . , n^) if n — p"^ . . .p^* 
is primitive recursive. 

Note. It doesn't matter what the function h may do on an n which 
does not represent a sequence of length k. 

Corollary 13.7. A k-place relation P is primitive recursive if and 
only if the 1-place relation P' is primitive recursive, where 

(ni,...,nfe) eP ^ p^.-.p^ e^'- 

A computable but not primitive recursive function. While 

primitive recursion and composition do not quite suffice to build all 
Turing computable functions from the initial functions, they are pow- 
erful enough that specific counterexamples are not all that easy to find. 

Example 13.4 (Ackerman's Function). Define the 2- place function 
A from as follows: 

• A{0,i) = S{i) 

• A(S(A;),0) = A(A;,1) 

• A(slk), S{e)) ^ A{k,A(S(k),e)) 

Given A, define the 1-place function a by a{n) = A(n, n). 

It isn't too hard to show that A, and hence also a, are Turing 
computable. However, though it takes considerable effort to prove it, 
a grows faster with n than any primitive recursive function. (Try 
working out the first few values of a. . . ) 
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Problem 13.8. Show that the functions A and a defined in Exam- 
ple 13.4 are Turing computable. 

If you are very ambitious, you can try to prove the following theo- 
rem. 

Theorem 13.9. Suppose a is the function defined in Example 13.4 
and f is any primitive recursive function. Then there is ann such 
that for all k > n, a{k) > f{k). 

Corollary 13.10. The function a defined in Example 13.4 '<^ot 
primitive recursive. 

. . . but if you aren't, you can still try the following exercise. 

Problem 13.11. Informally, define a computable function which 
must be different from every primitive recursive function. 

Unbounded minimalization. The last of our three method of 
building computable functions from computable functions is unbounded 
minimalization. The functions which can be defined from the initial 
functions using unbounded minimalization, as well as composition and 
primitive recursion, turn out to be precisely the Turing computable 
functions. 

Unbounded minimalization is the counterpart for functions of "brute 
force" algorithms that try every possibility until they succeed. (Which, 
of course, they might not. . . ) 

Definition 13.4. Suppose A; > 1 and g is & {k + l)-place func- 
tion. Then the unbounded minimalization of g is the /c-place function 
/ defined by 

/(ni, . . . , Uk) — m where m is least so that g{ni, . . . , rife, m) — 0. 
This is often written as /(rii, . . . , Uk) — ^m[g{ni, . . . , n^, m) = 0]. 

Note. If there is no m such that g{ni, . . . ,nk,m) = 0, then the 
unbounded minimalization of g is not defined on {ni, . . . ,nk). This is 
one reason we will occasionally need to deal with partial functions. 

If the unbounded minimalization of a computable function is to be 
computable, we have a problem even if we ask for some default out- 
put (0, say) to ensure that it is defined for all A;-tuples. The obvious 
procedure which tests successive values of g to find the needed m will 
run forever if there is no such m, and the incomputability of the Halt- 
ing Problem suggests that other procedure's won't necessarily succeed 
either. It follows that it is desirable to be careful, so far as possible, 
which functions unbounded minimalization is applied to. 
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Definition 13.5. A [k + l)-place function g is said to be regular 
if for every e N'^, there is at least one m e N so that 

g{ni,...,nk,m) = 0. 

That is, g is regular precisely if the obvious strategy of computing 
g{ni, . . . , rik, m) for m = 0, 1, . . . in succession until an m is found with 
girii, . . . ,nk,m) = always succeeds. 

Proposition 13.12. If g is a Turing computable regular {k + 1)- 
place function, then the unbounded minimalization of g is also Turing 
computable. 

While unbounded minimalization adds something essentially new to 
our repertoire, it is worth noticing that hounded minimalization does 
not. 

Problem 13.13. Suppose g is a {k + l)-place primitive recursive 
regular function such that for some primitive recursive k-place function 
h, 

lim[g{ni, . . . , n^, m) = 0] < h{ni, ...,nk) 
for all {rii, . . . , rik) G N. Show that iim[g{ni, . . . , rik, m) — 0] is also 
primitive recursive. 

Recursive functions and relations. We can finally define an 
equivalent notion of computability for functions on the natural numbers 
which makes no mention of any computational device. 

Definition 13.6. A A;- place function / is recursive if it can be 
defined from the initial functions by finitely many applications of com- 
position, primitive recursion, and the unbounded minimalization of 
regular functions. 

Similarly, /c-place partial function is recursive if it can be defined 
from the initial functions by finitely many applications of composition, 
primitive recursion, and the unbounded minimalization of (possibly 
non-regular) functions. 

In particular, every primitive recursive function is a recursive func- 
tion. 

Theorem 13.14. Every recursive function is Turing computable. 

We shall show that every Turing computable function is recursive 
later on. Similarly to primitive recursive relations we have the follow- 
ing. 

Definition 13.7. A /c-place relation P is said to be recursive {Tur- 
ing computable) if its characteristic function xp is recursive (Turing 
computable) . 
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Since every recursive function is Turing computable, and vice versa, 
"recursive" is just a synonym of "Turing computable" , for functions and 
relations alike. 

Also, similarly to Theorem 13.6 and Corollary 13.7 we have the 
following. 

Theorem 13.15. A k-place function g is recursive if and only if 
the 1-place function h given by h{n) = g{ni, . . . , rik) if n = p"^ . . .p^* 
is recursive. 

As before, it doesn't really matter what the function h does on an 
n which does not represent a sequence of length k. 

Corollary 13.16. A k-place relation P is recursive if and only if 
the 1-place relation P' is recursive, where 

(n,,...,n,)eP ^ pT---pT^P'- 
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Characterizing Computability 

By putting together some of the ideas in Chapters 12 and 13, we can 
use recursive functions to simulate Turing machines. This will let us 
show that Turing computable functions are recursive, completing the 
argument that Turing machines and recursive functions are essentially 
equivalent models of computation. We will also use these techniques to 
construct an universal Turing machine (or UTM): a machine U that, 
when given as input (a suitable description of) some Turing machine M 
and an input tape a for M, simulates the computation of M on input 
a. In effect, an universal Turing machine is a single piece of hardware 
that lets us treat other Turing machines as software. 

Turing computable functions are recursive. Our basic strat- 
egy is to show that any Turing machine can be simulated by some 
recursive function. Since recursive functions operate on integers, we 
will need to encode the tape positions of Turing machines, as well as 
Turing machines themselves, by integers. For simplicity, we shall stick 
to Turing machines with alphabet {1}; we already know from Chap- 
ter 11 that such machines can simulate Turing machines with bigger 
alphabets. 

Definition 14.1. Suppose (s,i,a) is a tape position such that all 
but finitely many cells of a are blank. Let n be any positive integer 
such that afe = for all k > n. Then the code of (s, i, a) is 

^(s,i,a)^ = 2"3*5"°7"ill"2 . . .pl^^ . 

Example 14.1. Consider the tape position (2, 1, 1001). Then 

^(2, 1, 1001)^ = 223^5^7°11°13^ ^ 780. 

Problem 14.1. Find the codes of the following tape positions. 

(1) (1,0, a), where a is entirely blank. 

(2) (4, 3, a), where a is 1011100101. 

Problem 14.2. What is the tape position whose code is 10314720? 

When dealing with computations, we will also need to encode se- 
quences of tape positions by integers. 
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Definition 14.2. Suppose tit2 ... t„ is a sequence of tape positions. 
Then the code of this sequence is 

Note. Both tape positions and sequences of tape positions have 
unique codes. 

Problem 14.3. Pick some (short!) sequence of tape positions and 
find its code. 

Having defined how to represent tape positions as integers, we now 
need to manipulate these representations using recursive functions. 
The recursive functions and relations in Problems 13.3 and 13.5 provide 
most of the necessary tools. 

Problem 14.4. Show that both of the following relations are prim- 
itive recursive. 

(1) TapePos, where TAPEPos(n) <^=^ n is the code of a tape 
position. 

(2) TapePosSeq, where TAPEPosSEQ(n) ■^=^ n is the code of 
a sequence of tape positions. 

Problem 14.5. Show that each of the following is primitive recur- 
sive. 

(1) The A-place function Entry such that 

ENTRY(j, UJjt,^) 

'^{t, i + a!y ifn = I, a)^, j G {0, 1}, 

_ ^ w e {0,2}, i + w -1>Q, andt>l, 

where a'^ = Ofe for k ^ i and a[ = j; 
otherwise. 

(2) For any Turing machine M with alphabet {1}, the 1-place 
function StePm such that 

{'"M(s,i,a)"' if n = ^{s,i,Si)^ and 
M(s,i,a) is defined; 
otherwise. 

(3) For any Turing machine M with alphabet the 1-place re- 
lation ComPm; where 

ComPm(?T') n is the code of a computation of M. 
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The functions and relations above may be primitive recursive, but 
the last big step in showing that Turing computable functions are re- 
cursive requires unbounded minimalization. 

Proposition 14.6. For any Turing machine M with alphabet {1}, 
the 1-place (partial) function SiMm is recursive, where 



if n = '"(1,0, a)"' for some input tape a and M eventually halts in 
position {t,j,h) on input a. (Note that SlMM{n) may be undefined if 
n 7^ '"(1, 0, a)"' for an input tape a., or if M does not eventually halt on 
input a..) 

Lemma 14.7. Show that the following functions are primitive re- 
cursive: 

(1) For any fixed k > 1, CODEfc(ni, . . . , nfc) = ^(1, 0, Ol^iQ . . . 01"'=) 

(2) DECODE(t) = n if t = '~(s,i,01"+^)"' (and anything you tike 
otherwise). 

Theorem 14.8. Any k-place Turing computable function is recur- 
sive. 

Corollary 14.9. A function f -.N^ ^ N is Turing computable if 
and only if it is recursive. 

Thus Turing machines and recursive functions are essentially equiv- 
alent models of computation. 

An universal Turing machine. One can push the techniques 
used above little farther to get a recursive function that can simulate 
any Turing machine. Since every recursive function can be computed 
by some Turing machine, this effectively gives us an universal Turing 
machine. 

Problem 14.10. Devise a suitable definition for the code '"M"' of 
a Turing machine M with alphabet {1}. 

Problem 14.11. Show, using your definition of^M^ from Problem 
14-10, that the following are primitive recursive. 

(1) The 2-place function Step, where 



SiMmH =^(i,j,b)^ 




'"M(s, i, a)"' if m — '"M"' for some machine M, 

n — '"(s, i, a)"'; & M(s, i, a) is defined; 
otherwise. 
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(2) The 2-place relation COMP, where 

COMP(m,n) <^=^ m = '"M"' 

for some Turing machine M and n is the code of a computation 
ofM. 

Proposition 14.12. The 2-place (partial) function Sim is recur- 
sive, where, for any Turing machine M with alphabet {1} and input 
tape a for M, 

SiMrM^,^(l,0,ar)=^(i,j,br 

if M eventually halts in position {t, j, b) on input a. (Note that SlM(m, n) 

may be undefined if m is not the code of some Turing machine M, or 
if n ^ '^(1,0, a)"' for an input tape a, or if M does not eventually halt 
on input sl.) 

Corollary 14.13. There is a Turing machine U which can simu- 
late any Turing machine M. 

Corollary 14.14. There is a recursive function f which can com- 
pute any other recursive function. 

The Halting Problem. An effective method to determine whether 
or not a given machine wiU eventuaUy halt on a given input — short 
of waiting forever! — would be nice to have. For example, assuming 
Church's Thesis is true, such a method would let us identify computer 
programs which have infinite loops before we attempt to execute them. 

The Halting Problem. Given a Turing machine M and an in- 
put tape a, is there an effective method to determine whether or not 
M eventually halts on input a? 

Given that we are using Turing machines to formalize the notion 
of an effective method, one of the difficulties with solving the Halting 
Problem is representing a given Turing machine and its input tape as 
input for another machine. As this is one of the things that was accom- 
plished in the course of constructing an universal Turing machine, we 
can now formulate a precise version of the Halting Problem and solve 
it. 

The Halting Problem. Is there a Turing machine T which, for 
any Turing machine M with alphabet {1} and tape a for M, halts on 
input 

Q2'"M"'+lg-j^'"(l,0,a)"'+l 

with output oil if M halts on input a, and with output 01 if M does 
not halt on input a? 



14. CHARACTERIZING COMPUTABILITY 



99 



Note that this precise version of the Halting Problem is equivalent 
to the informal one above only if Church's Thesis is true. 

Problem 14.15. Show that there is a Turing machine C which, for 
any Turing machine M with alphabet on input 

g-|^'"M"'+l 

eventually halts with output 

Theorem 14.16. The answer to (the precise version of) the Halting 
Problem is "No. " 

Recursively enumerable sets. The following notion is of partic- 
ular interest in the advanced study of computability. 

Definition 14.3. A subset {i.e. a 1-place relation) P of N is re- 
cursively enumerable, often abbreviated as r.e., if there is a 1-place 
recursive function / such that P — im(/) = { f{n) | n e N }. 

Since the image of any recursive 1-place function is recursively enu- 
merable by definition, we do not lack for examples. For one, the set E 
of even natural numbers is recursively enumerable, since it is the image 

of f{n) = MuLT(S(S(0(n))), n). 

Proposition 14.17. If P is a 1-place recursive relation, then P is 
recursively enumerable. 

This proposition is not reversible, but it does come close. 

Proposition 14.18. P C N zs recursive if and only if both P and 

N\P are recursively enumerable. 

Problem 14.19. Find an example of a recursively enumerable set 
which is not recursive. 

Problem 14.20. /s P C N primitive recursive if and only if both 
P and N \ P are enumerable by primitive recursive functions? 

Problem 14.21. P C N recursively enumerable if and only if there 
is a 1-place recursive partial function g such that P — dom{g) = {n \ 
g{n) is defined} 
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Hints for Chapter 10. 

10.1. This should be easy. . . 

10.2. Ditto. 

10.3. (1) Any machine with the given alphabet and a table 
with three non-empty rows will do. 

(2) Every entry in the table in the column must write a 1 in the 
scanned cell; similarly, every entry in the 1 column must write 
a in the scanned cell. 

(3) What's the simplest possible table for a given alphabet? 

10.4. Unwind the definitions step by step in each case. Not all of 
these are computations. . . 

10.5. Examine your solutions to the previous problem and, if nec- 
essary, take the computations a little farther. 

10.6. Have the machine run on forever to the right, writing down 
the desired pattern as it goes no matter what may be on the tape 
already. 

10.7. Consider your solution to Problem 10.6 for one possible ap- 
proach. It should be easy to find simpler solutions, though. 

10.8. Consider the tasks S and T are intended to perform. 

10.9. (1) Use four states to write the Is, one for each. 

(2) The input has a convenient marker. 

(3) Run back and forth to move one marker n cells from the block 
of I's while moving another through the block, and then fill in. 

(4) Modify the previous machine by having it delete every other 
1 after writing out 1^". 

(5) Run back and forth to move the right block of Is cell by cell 
to the desired position. 

(6) Run back and forth to move the left block of Is cell by cell 
past the other two, and then apply a minor modification of 
the machine in part 5. 
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(7) Variations on the ideas used in part 6 should do the job. 

(8) Run back and forth between the blocks, moving a marker 
through each. After the race between the markers to the ends 
of their respective blocks has been decided, erase everything 
and write down the desired output. 

Hints for Chapter 11. 

11.1. This ought to be easy. 

11.2. Generalize the technique of Example 11.1, adding two new 
states to help with each old state that may cause a move in different 
directions. You do have to be a bit careful not to make a machine that 
would run off the end of the tape when the original would not. 

11.3. You only need to change the parts of the definitions involving 
the symbols and 1. 

11.4. If you have trouble figuring out whether the subroutine of Z 
simulating state 1 of on input y, try tracing the partial computations 
of W and Z on other tapes involving y. 

11.5. Generalize the concepts used in Example 11.2. Note that the 
simulation must operate with coded versions of Ms tape, unless S = 
{1}. The key idea is to use the tape of the simulator in blocks of some 
fixed size, with the patterns of Os and Is in each block corresponding 
to elements of E. 

11.6. This should be straightforward, if somewhat tedious. You do 
need to be careful in coming up with the appropriate input tapes for 
O. 

11.7. Generalize the technique of Example 11.3, splitting up the 
tape of the simulator into upper and lower tracks and splitting each 
state of N into two states in P. You will need to be quite careful in 
describing just how the latter is to be done. 

11.8. This is mostly pretty easy. The only problem is to devise iV 
so that one can tell from its output whether P halted or crashed, and 
this is easy to indicate using some extra symbol in A^s alphabet. 

11.9. If you're in doubt, go with one read/write scanner for each 
tape, and have each entry in the table of a two-tape machine take 
both scanners into account. Simulating such a machine is really just a 
variation on the techniques used in Example 11.3. 
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11.10. Such a machine should be able to move its seamier to cells 
up and down from the current one, as well to the side. (Diagonally too, 
if you want to!) Simulating such a machine on a single tape machine is 
a challenge. You might find it easier to first describe how to simulate 
it on a suitable multiple-tape machine. 

Hints for Chapter 12. 

12.1. (1) Delete most of the input. 

(2) Add a one to the far end of the input. 

(3) Add a little to the input, and delete a little more elsewhere. 

(4) Delete a little from the input most of the time. 

(5) Run back and forth between the two blocks in the input, delet- 
ing until one side disappears. Clean up appropriately! (This 
is a relative of Problem 10.9.8.) 

(6) Delete two of blocks and move the remaining one. 

(7) This is just a souped-up version of the machine immediately 
preceding. . . 

12.2. There are just as many functions N N as there are real 
numbers, but only as many Turing machines as there are natural num- 
bers. 

12.3. (1) Trace the computation through step-by-step. 

(2) Consider the scores of each of the 1-state entries in the busy 
beaver competition. 

(3) Find a 3-state entry in the busy beaver competition which 
scores six. 

(4) Show how to turn an n-state entry in the busy beaver compe- 
tition into an (n + l)-state entry that scores just one better. 

12.4. You could start by looking at modifications of the 3-state 
entry you devised in Problem 12.3.3, but you will probably want to do 
some serious fiddling to do better than what Problem 12.3.4 do from 
there. 

12.5. Suppose E was computable by a Turing machine M. Modify 
M to get an n-state entry in the busy beaver competition for some 
n which achieves a score greater than E(n). The key idea is to add 

a "pre-processor" to M which writes a block with more Is than the 
number odf states that M and the pre-processor have between them. 

12.6. Generalize Example 12.5. 
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12.7. Use machines computing g, hi, . . . , hm as sub-machines of 
the machine computing the composition. You might also find sub- 
machines that copy the original input and various stages of the output 
useful. It is important that each sub-machine get all the data it needs 
and does not damage the data needed by other sub-machines. 

12.8. Proceed by induction on the number of applications of com- 
position used to define / from the initial functions. 

Hints for Chapter 13. 

13.1. (1) Exponentiation is to multiplication as multiplication 
is to addition. 

(2) This is straightforward except for taking care of Pred(O) — 
Pred(1) = 0. 

(3) DiFF is to Pred as S is to Sum. 

(4) This is straightforward if you let 0! = 1. 

13.2. Machines used to compute g and h are the principal parts 
of the machine computing /, along with parts to copy, move, and/or 
delete data on the tape between stages in the recursive process. 

13.3. (1) / is to 5^ as Fact is to the identity function. 

(2) Use DiFF and a suitable constant function as the basic building 
blocks. 

(3) This is a slight generalization of the preceding part. 

13.4. Proceed by induction on the number of applications of prim- 
itive recursion and composition. 

13.5. (1) Use a composition including DiFF, xpj ^ suit- 
able constant function. 

(2) A suitable composition will do the job; it's just a little harder 
than it looks. 

(3) A suitable composition will do the job; it's rather more straight- 
forward than the previous part. 

(4) Note that n = m exactly when n — m = = m — n. 

(5) Adapt your solution from the first part of Problem 13.3. 

(6) First devise a characteristic function for the relation 



and then sum up. 

(7) Use xdiv and sum up. 

(8) Use IsPrime and some ingenuity. 

(9) Use Exp and Div and some more ingenuity. 

(10) A suitable combination of Prime with other things will do. 
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(11) A suitable combination of Prime and Power will do. 

(12) Throw the kitchen sink at this one. . . 

(13) Ditto. 

13.6. In each direction, use a composition of functions already 
known to be primitive recursive to modify the input as necessary. 

13.7. A straightforward application of Theorem 13.6. 

13.8. This is not unlike, though a little more complicated than, 
showing that primitive recursion preserves computability. 

13.9. It's not easy! Look it up. . . 

13.10. This is a very easy consequence of Theorem 13.9. 

13.11. Listing the definitions of all possible primitive recursive 
functions is a computable task. Now borrow a trick from Cantor's 
proof that the real numbers are uncountable. (A formal argument to 
this effect could be made using techniques similar to those used to show 
that all Turing computable functions are recursive in the next chapter.) 

13.12. The strategy should be easy. Make sure that at each stage 
you preserve a copy of the original input for use at later stages. 

13.13. The primitive recursive function you define only needs to 
check values of g{ni, . . . , n^, m) for m such that < m < h{ni, . . . , n^), 
but it still needs to pick the least m such that g{ni, . . . , Uk, m) — 0. 

13.14. This is very similar to Theorem 13.4. 

13.15. This is virtually identical to Theorem 13.6. 

13.16. This is virtually identical to Corollary 13.7. 
Hints for Chapter 14. 

14.1. Emulate Example 14.1 in both parts. 

14.2. Write out the prime power expansion of the given number 
and unwind Definition 14.1. 

14.3. Find the codes of each of the positions in the sequence you 
chose and then apply Definition 14.2. 

14.4. (1) XtapePos(?T') = 1 exactly when the power of 2 in the 

prime power expansion of n is at least 1 and every other prime 
appears in the expansion with a power of or 1. This can 
be achieved with a composition of recursive functions from 
Problems 13.3 and 13.5. 
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(2) XtapePosSeq(^) = 1 exactly when n is the code of a sequence of 
tape positions, i.e. every power in the prime power expansion 
of n is the code of a tape position. 

14.5. ( 1 ) If the input is of the correct form, make the necessary 
changes to the prime power expansion of n using the tools in 
Problem 13.5. 

(2) Piece StePm together by cases using the function Entry in 
each case. The piecing-together works a lot like redefining a 
function at a particular point in Problem 13.3. 

(3) If the input is of the correct form, use the function StePm 
to check that the successive elements of the sequence of tape 
positions are correct. 

14.6. The key idea is to use unbounded minimalization on Xcomp, 
with some additions to make sure the computation found (if any) starts 
with the given input, and then to extract the output from the code of 
the computation. 

14.7. (1) To define CoDEfc, consider what r(l,0,01"iO. . .01"'=)^ 
is as a prime power expansion, and arrange a suitable compo- 
sition to obrtain it from (ni, . . . , n^). 

(2) To define DECODE you only need to count how many pow- 
ers of primes other than 3 in the prime-power expansion of 
'"(s, i, 01""'"^)"' are equal to 1. 

14.8. Use Proposition 14.6 and Lemma 14.7. 

14.9. This follows directly from Theorems 13.14 and 14.8. 

14.10. Take some creative inspiration from Definitions 14.1 and 
14.2. For example, if (s, i) e dom(M) and M(s, i) — (j, d, t), you could 
let the code of M(s, i) be 

^M(s,i)^ = 2^3^5^' 7*^+^11*. 

14.11. Much of what you need for both parts is just what was 
needed for Problem 14.5, except that Step is probably easier to define 
than StePm was. (Define it as a composition. . . ) The additional 
ingredients mainly have to do with using m = '"M"' properly. 

14.12. Essentially, this is to Problem 14.11 as proving Proposition 
14.6 is to Problem 14.5. 

14.13. The machine that computes SIM does the job. 
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14.14. A modification of SIM does the job. The modifications are 
needed to handle appropriate input and output. Check Theorem 13.15 
for some ideas on what may be appropriate. 

14.15. This can be done directly, but may be easier to think of in 
terms of recursive functions. 

14.16. Suppose the answer was yes and such a machine T did exist. 
Create a machine U as follows. Give T the machine C from Problem 
14. 15 as a pre-processor and alter its behaviour by having it run forever 
if M halts and halt if M runs forever. What will T do when it gets 
itself as input? 

14.17. Use Xp to help define a function / such that im(/) — P. 

14.18. One direction is an easy application of Proposition 14.17. 
For the other, given an n G N, run the functions enumerating P and 
N \ P concurrently until one or the other outputs n. 

14.19. Consider the set of natural numbers coding (according to 
some scheme you must devise) Turing machines together with input 
tapes on which they halt. 

14.20. See how far you can adapt your argument for Proposition 
14.18. 

14.21. This may well be easier to think of in terms of Turing ma- 
chines. Run a Turing machine that computes g for a few steps on the 
first possible input, a few on the second, a few more on the first, a few 
more on the second, a few on the third, a few more on the first, . . . 



Part IV 
Incompleteness 



CHAPTER 15 



Preliminaries 

It was mentioned in the Introduction that one of the motivations for 
the development of notions of computability was the following question. 

Entscheidungsproblem. Given a reasonable set E of formulas 
of a first-order language C and a formula (/? of >C, is there an effective 
method for determining whether or not S h □ 

Armed with knowledge of first-order logic on the one hand and 

of computability on the other, we are in a position to formulate this 
question precisely and then solve it. To cut to the chase, the answer is 
usually "no". Godel's Incompleteness Theorem asserts, roughly, that 
given any set of axioms in a first-order language which are computable 
and also powerful enough to prove certain facts about arithmetic, it 
is possible to formulate statements in the language whose truth is not 
decided by the axioms. In particular, it turns out that no consistent 
set of axioms can hope to prove its own consistency. 

We will tackle the Incompleteness Theorem in three stages. First, 
we will code the formulas and proofs of a first-order language as num- 
bers and show that the functions and relations involved are recursive. 
This will, in particular, make it possible for us to define a "computable 
set of axioms" precisely. Second, we will show that all recursive func- 
tions and relations can be defined by first-order formulas in the presence 
of a fairly minimal set of axioms about elementary number theory. Fi- 
nally, by putting recursive functions talking about first-order formulas 
together with first-order formulas defining recursive functions, we will 
manufacture a self-referential sentence which asserts its own unprov- 
ability. 

Note. It will be assumed in what follows that you are familiar with 

the basics of the syntax and semantics of first-order languages, as laid 
out in Chapters 5-8 of this text. Even if you are already familiar with 
the material, you may wish to look over Chapters 5-8 to familiarize 
yourself with the notation, definitions, and conventions used here, or 
at least keep them handy in case you need to check some such point. 
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15. PRELIMINARIES 



A language for first-order number theory. To keep things as 
concrete as possible we will work with and in the following language 
for first-order number theory, mentioned in Example 5.2. 

Definition 15.1. jCn is the first-order language with the following 

symbols: 

(1) Parentheses: ( and ) 

(2) Connectives: -i and 

(3) Quantifier: V 

(4) Equality: = 

(5) Variable symbols: Vq, V2, v^, . . . 

(6) Constant symbol: 

(7) 1-place function symbol: S 

(8) 2-place function symbols: -|-, •, and E. 

The non-logical symbols of £jv, 0, S, +, ■, and E, are intended 
to name, respectively, the number zero, and the successor, addition, 
multiplication, and exponentiation functions on the natural numbers. 
That is, the (standard!) structure this language is intended to discuss 
is 91= (N,0,S,+,-,E). 

Completeness. The notion of completeness used in the Incom- 
pleteness Theorem is difi^erent from the one used in the Completeness 
Theorem.^ "Completeness" in the latter sense is a property of a logic: 
it asserts that whenever F |= cr {i.e. the truth of the sentence a follows 
from that of the set of sentences T), T \- a (i.e. there is a deduction of a 
from r). The sense of "completeness" in the Incompleteness Theorem, 
defined below, is a property of a set of sentences. 

Definition 15.2. A set of sentences S of a first-order language £ 
is said to be complete if for every sentence r either E h r or E I — it. 

That is, a set of sentences, or non-logical axioms, is complete if it 
suffices to prove or disprove every sentence of the langage in in question. 

Proposition 15.1. A consistent set Ti of sentences of a first-order 
language C is complete if and only if the theory of E, 

Th(E) — {t \ T is a sentence of C and E h r } , 

is maximally consistent. 



Which, to confuse the issue, was also first proved by Kurt Godel. 
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Coding First-Order Logic 

We will encode the symbols, formulas, and deductions of Cn as 
natural numbers in such a way that the operations necessary to ma- 
nipulate these codes are recursive. Although we will do so just for Cn, 
any countable first-order language can be coded in a similar way. 

Godel coding. The basic approach of the coding scheme we will 
use was devised by Godel in the course of his proof of the Incomplete- 
ness Theorem. 

Definition 16.1. To each symbol s oi Cn we assign an unique 
positive integer '"s"', the Godel code of s, as follows: 

(1) = 1 and = 2 

(2) ^ 3 and = 4 

(3) = 5 

(4) r=n = Q 

(5) ^vP = k + 12 

(6) = 7 

(7) = 8 

(8) r+n ^ 9^ r.n ^ ^q, and ^E^ = 11 

Note that each positive integer is the Godel code of one and only one 

symbol of Cn- We will also need to code sequences of the symbols of 
Cn, such as terms and formulas, as numbers, not to mention sequences 
of sequences of symbols of i2Ar, such as deductions. 

Definition 16.2. Suppose siS2 ... is a sequence of symbols of 
Cn- Then the Godel code of this sequence is 

where p„ is the nth prime number. 

Similarly, if o"iO"2 . . . o-£ is a sequence of sequences of symbols of Cn, 
then the Godel code of this sequence is 
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Example 16.1. The code of the formula Vwi = -viSOvi (the official 
form of V^;i vi ■ SO — vi), '~\/vi — ■viSOvi~', works out to 

= 109425289274918632559342112641443058962750733001979829025245569500000. 

This is not the most efficient conceivable coding scheme! 

Example 16.2. The code of the sequence of formulas 

= 00 i.e. = 

(=00^=5050) I.e. = 0^ SO = SO 

^SOSO i.e. SO^SO 

works out to 

2'" =00"' 3'" (=00-^=5050) 5'"=5050"' 
^ 22'^'3"0"5"0" 

_ 32'"("'3'"="'5'"0"'7'"0"'ir^"'l3'"="'l7'"«"'l9'"0"'23'"5'-'29'"0"'3r)"' 

_ g2"="3"«"5"0"7"«"iro" 
_ 22«3''5''32i365''7''ll'*136l7«19''23«29''3l2g263«5''7«ll'' 

which is large enough not to be worth the bother of working it out 
explicitly. 

Problem 16.1. Pick a short sequence of short formulas of Cn and 
find the code of the sequence. 

A particular integer n may simultaneously be the Godel code of a 
symbol, a sequence of symbols, and a sequence of sequences of symbols 
of Cn- We shall rely on context to avoid confusion, but, with some 
more work, one could set things up so that no integer was the code of 
more than one kind of thing. In any case, we will be most interested 
in the cases where sequences of symbols are (official) terms or formulas 
and where sequences of sequences of symbols are sequences of (official) 
formulas. In these cases things are a little simpler. 

Problem 16.2. Is there a natural number n which is simultaneously 
the code of a symbol of Cn, the code of a formula of Cn, and the code 
of a sequence of formulas ofCN? If not, how many of these three things 
can a natural number be? 

Recursive operations on Godel codes. We will need to know 
that various relations and functions which recognize and manipulate 
Godel codes are recursive, and hence computable. 
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Problem 16.3. Show that each of the following relations is primi- 
tive recursive. 

(1) TERM(n) <^=^ n — '"i"' for some term t of jCn- 

(2) FoRMULA(n) <^=^ n = '"(/?"' for some formula ip of Cn- 

(3) Sentence (n) <^=^ n = '"cr"' for some sentence a of Cn. 

(4) LOGlCAL(n) <^=^ n = '"7"' for some logical axiom 7 of Cn- 

Using these relations as building blocks, we will develop relations 
and functions to handle deductions of Cn- First, though, we need to 
make "a computable set of formulas" precise. 

Definition 16.3. A set A of formulas of vCat is said to be recursive 
if the set of Godel codes of formulas of A, 

r A^ = { I 5 e A } , 

is a recursive subset of N (ie. a recursive 1-place relation). Similarly, 
A is said to be recursively enumerable if '"A"' is recursively enumerable. 

Problem 16.4. Suppose A is a recursive set of sentences of Cn- 
Show that each of the following relations is recursive. 

(1) PremisSa(^) <^=^ n = '"/?"' for some formula j3 of Cn which 
is either a logical axiom or in A. 

(2) FORMULAS(n) <^=^ n — '~(fi . . .ipk~^ for some sequence 
(pi . . .(pk of formulas of Cn- 

(3) lNFERENCE(n, j) <^=^ u = ^ifi... ipk^ for some sequence 
ipi . . .ipk of formulas of Cn , ^ <ii3 ^k, and ipk follows from 
ipi and ipj by Modus Ponens. 

(4) Deduction A (tt-) <^=^ n — '~(fi . . . (pk'^ for a deduction (pi . . .(pk 
from A in Cn- 

(5) CoNCLUSlONA(n, m) <^=^ n = '~(pi . . .(pk~^ for a deduction 

ipi . . .ipk from A in Cn and m = ^ipk^. 
IfA'^ is primitive recursive, which of these are primitive recursive? 

It is at this point that the connection between computability and 
completeness begins to appear. 

Theorem 16.5. Suppose A is a recursive set of sentences of Cn- 
Then rTh(A)^ is 

(1) recursively enumerable, and 

(2) recursive if and only if A is complete. 

Note. It follows that if A is not complete, then '"Th(A)"' is an 
example of a recursively enumerable but not recursive set. 
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Defining Recursive Functions In Arithmetic 

The definitions and results in Chapter 17 let us use natural numbers 
and recursive functions to code and manipulate formulas of Cn- We 
will also need complementary results that let us use terms and formu- 
las of £jv to represent and manipulate natural numbers and recursive 
functions. 

Axioms for bcisic cirithmetic. Wc will define a set of non-logical 

axioms in Cn which prove enough about the operations of successor, 
addition, mutliplication, and exponentiation to let us define all the 
recursive functions using formulas of >Cjv. The non-logical axioms in 
question essentially guarantee that basic arithmetic works properly. 

Definition 17.1. Let A be the following set of sentences of jCn, 
written out in official form. 

Nl: Vuo {^ = SvoO) 

N2: Wvo ((- = voO) ^ {^^vi = SviVo))) 

N3: WvqWvi (= SvqSvi — vqVi) 

N4: V^;o = +Vo0vo 

N5: \/vo\fvi — +voSviS + vqVi 

N6: ^vo = -voOO 

N7: WvqWvi = -vqSvi + -vqViVq 

N8: V-Uo = EvoOSO 

N9: VvoVvi = EvqSvi ■ EvqViVo 

Translated from the official forms, A consists of the following ax- 
ioms about the natural numbers: 



Nl: 


For 


all 


n, 


n + 1 ^ 0. 


N2: 


For 


all 


n. 


, n 7^ there is a A; such that k + 1 = n. 


N3: 


For 


all 


n 


and k,n+\ = k + l implies that n = k 


N4: 


For 


all 


n, 


n + — n. 


N5: 


For 


all 


n 


and k, n + (k + 1) ^ (n + k) + 1. 


N6: 


For 


all 


n. 


n-0 = 0. 


N7: 


For 


all 


n 


and k, n ■ {k + 1) = {n ■ k) + n. 


N8: 


For 


all 


n. 


, n° = 1. 


N9: 


For 


all 


n 


and k, n'^'^^ — {n^) ■ n. 
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17. DEFINING RECURSIVE FUNCTIONS IN ARITHMETIC 



Each of the axioms in A is true of the natural numbers: 

PROPOSITION 17.1. m ^ A, where ^ = (N, 0, S, +, •, E) is the 
structure consisting of the natural numbers with the usual zero and 
the usual successor, addition, multiplication, and exponentiation oper- 
ations. 

However, ^ is a long way from being able to prove all the sentences 
of first-order arithmetic true in DT. For example, though we won't prove 
it, it turns out that A is not enough to ensure that induction works: 
that for every formula (p with at most the variable x free, if ip^ and 

if I ^ Vsy) hold, then so does Vx On the other hand, neither Cn 
nor A are quite as minimal as they might be. For example, with some 
(considerable) extra effort one could do without E and define it from • 
and +. 

Representing functions and relations. For convenience, we 
will adopt the following conventions. First, we will often abbreviate 
the term ol Cn consisting oi m Ss followed by by S"^0. For example, 
S^O abbreviates SSSO. The term S"^0 is a convenient name for the 
natural number m in the language Cn since the interpretation of S"^0 
in 91 is m: 

Lemma 17.2. For every m e N and every assignment s for DT, 
s(5™0) = m. 

Second, if (/? is a formula of with all of its free variables among 
vi, . . . , Vk, and mo, mi, . . . , ruk are natural numbers, we will write 
9?(5™i0, . . . , ^'"'=0) for the sentence <^5mio':...,s-feO' ^-e- <^ with S'^'O sub- 
stituted for every free occurrence of Vi. Since the term ^'"^^O involves 
no variables, it is substitutable for Vi in (f. 

Definition 17.2. Suppose E is a set of sentences of Cn- A A;-place 
function / is said to be representable in Th(S) = {r|SI-r}if there 
is a formula ip of Cn with at most vi, . . . ,Vk, and Vk+i as free variables 
such that 

/(m, . . . , rife) = m ^ ifiiS'^^ S^'O, S'^O) e Th(S) 

for all rii, . . . , rik, and m in N. The formula (f is said to represent f in 
Th(E). 

We will use this definition mainly with — A. 

Example 17.1. The constant function Cg given by cl{n) = 3 is 
representable in Th(^); V2 — S^O is a formula representing it. Note 



17. DEFINING RECURSIVE FUNCTIONS IN ARITHMETIC 119 

that that this formula has no free variable for the input of the 1-place 
function, but then the input is irrelevant. . . 

To see that V2 — S^O really does represent Cg in Th(^) , we need to 
verify that 

for all n,m eN. 

In one direction, suppose that cl{n) — m. Then, by the definition 
of C3, we must have m = 3. Now 

{l)'ixx = x ^ S^Q^S^Q A4 

(2) Mxx = x A8 

(3) S^Q = S^Q 1,2 MP 

is a deduction of 5'^0 — S^Q from A. Hence if c\{n) — m, then A h 

In the other direction, suppose that A h S'™'0 = 5*^0. Since |= A, 
it follows that 91 |= ^'"0 = 5^0. It follows from Lemma 17.2 that 
m = 3, so c\{n) = m. Hence if ^ h 5'"^0 = 5*^0, then c\{n) = m. 

Problem 17.3. Show that the projection function 7r| can be repre- 
sented in Th(^). 

Definition 17.3. A /c-place relation P C N'^ is said to be repre- 
sentable in Th(S) if there is a formula ip of Cn with at most vi, . . . , 
Vk as free variables such that 

P(ni,...,nfe) ^ V('5"^0,...,,S"'=0) eTh(E) 

<^ Eh V('5"iO,...,,5"'=0) 

for all ni, . . . , in N. The formula ip is said to represent P in Th(S). 

We will also use this definition mainly with = A. 

Example 17.2. Almost the same formula, Vi = S^O, serves to 
represent the set — i.e. 1-place relation — {3} in Th(^). Showing 
that vi = S^O really does represent {3} in Th(^) is virtually identical 
to the corresponding argument in Example 17.1. 

Problem 17.4. Explain why V2 = SSSO does not represent the set 
{3} in Th(^) and vi — SSSO does not represent the constant function 
cl in Th{A). 

Problem 17.5. Show that the set of all even numbers can repre- 
sentable in Th(^) . 
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Problem 17.6. Show that the initial functions are representable in 
Th{A): 

(1) The zero function 0{n) = 0. 

(2) The successor function S(n) = n + 1. 

(3) For every positive k and i < k, the projection function tt'^. 

It turns out that all recursive functions and relations are repre- 
sentable in Th(^). 

Proposition 17.7. A k -place function f is representable in Tli(^) 
if and only if the k + 1-place relation Pf defined by 

Pfijii, . . . ,nk,nk+i) f{ni,...,nk) =nk+i 

is representable in Th(^). 

Also, a relation P C.N'^ is representable in Th(^) if and only if its 
characteristic function xp is representable in Tli(^) . 

Proposition 17.8. Suppose g\, . . . , gm are k-place functions and 

h is an m-place function, all of them representable in Th(^). Then 
f = h o [gi, . . . , g^n) is a k-place function representable in Th(^). 

Proposition 17.9. Suppose g is a k + 1-place regular function 
which is representable in Tli(^). Then the unbounded minimalization 
of g is a k-place function representable in Th(^). 

Between them, the above results supply most of what is needed 
to conclude that all recursive functions and relations on the natural 
numbers are representable. The exception is showing that functions 
defined by primitive recursion from representable functions are also 
representable, which requires some additional effort. The basic problem 
is that it is not obvious how a formula defining a function can get at 
previous values of the function. To accomplish this, we will borrow a 
trick from Chapter 13. 

Problem 17.10. Show that each of the following relations and func- 
tions (first defined in Problem 13.5) is representable in Th(^). 

(1) Div(n,m) <^=^ n \ m 

(2) IsPRlME(n) <^=^ n is prime 

(3) Prime(A;) = pk, where po — 1 andpk is the kth prime ifk > 1. 

(4) PowER(n, m) = k, where k >0 is maximal such that \ m. 

(5) LENGTH(n) = i, where i is maximal such that pi \ n. 

(6) ELEMENT(n, i) = Ui, where n = (and Ui = if 
i> k). 

Using the representable functions and relations given above, we can 
represent a "history function" of any representable function. . . 
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Problem 17.11. Suppose f is a k-place function representable in 
Th{A). Show that 

F{ni, ...,nk,m)= ■'^) . . . p/K-..,n„m) 



J{ni,...,nk,i) 
i 

i=0 



is also representable in Th(^). 
. . . and use it! 

Proposition 17.12. Suppose g is a k-place function and h is a 
k + 2-place function, both representable in Th(^). Then the k + 1- 
place function f defined by primitive recursion from g and h is also 
representable in Th(^). 

Theorem 17.13. Recursive functions are representable in Th(^). 

In particular, it follows that there are formulas of Cn represent- 
ing each of the functions from Chapter 16 for manipulating the codes 
of formulas. This will permit us to construct formulas which encode 

assertions about terms, formulas, and deductions; we will ultimately 
prove the Incompleteness Theorem by showing there is a formula which 
codes its own unprovability. 

Representability. We conclude with some more general facts about 
representability. 

Proposition 17.14. Suppose J2 is a set of sentences of Cn and f 
is a k-place function which is representable in Th(S). Then E must be 
consistent. 

Problem 17.15. //E is a set of sentences of Cn and P is a k-place 
relation which is representable in Th(E), does E have to be consistent? 

Proposition 17.16. Suppose E and F are consistent sets of sen- 
tences of Cn (ind S h F, i.e. E h 7 for every 7 G F. Then every 
function and relation which is representable in Th(F) is representable 
in Th(S). 

This lets us use everything we can do with representability in Th(^) 
with any set of axioms in Cn that is at least as powerful as A. 

Corollary 17.17. Functions and relations which representable in 
Th(^) are also representable in Th(S), for any consistent set of sen- 
tences E such that E h ^. 
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The Incompleteness Theorem 

The material in Chapter 16 effectively allows us to use recursive 
functions to manipulate coded formulas of ^at, while the material in 
Chapter 17 allows us to represent recursive functions using formulas 
Cn- Combining these techniques allows us to use formulas of Cn to 
refer to and manipulate codes of formulas of ^at. This is the key to 
proving Godel's Incompleteness Theorem and related results. 

In particular, we will need to know one further trick about manip- 
ulating the codes of formulas recursively, that the operation of substi- 
tuting (the code of) the term S^Q into (the code of) a formula with one 
free variable is recursive. 

Problem 18.1. Show that the function 

{'"(/?(S''^0)"' if n — '"</?"' for a formula (/? of Cn 
with at most vi free 
otherwise 

is recursive, and hence representable Th(^). 

In order to combine the the results from Chapter 16 with those 
from Chapter 17, we will also need to know the following. 

Lemma 18.2. A is a recursive set of sentences of Cn- 

The First Incompleteness Theorem. The key result needed to 
prove the First Incompleteness Theorem (another will follow shortly!) 
is the following lemma. It asserts, in effect, that for any statement 
about (the code of) some sentence, there is a sentence a which is true 
or false exactly when the statement is true or flase of (the code of) a. 
This fact will allow us to show that the self-referential sentence we will 
need to verify the Incompleteness theorem exists. 

Lemma 18.3 (Fixed-Point Lemma). Suppose (p is a formula of Cn 
with only vi as a free variable. Then there is a sentence a of Cn such 
that 

A^a^ (/?(,S'^'0) . 
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Note that a must be different from tlie sentence 99(6''' '^"'O): tliere is 
no way to find a formula witfi one free variable and an integer k such 
that '"(/9(S''^0)~' = k. (Think about how Godel codes are defined. . . ) 

With the Fixed-Point Lemma in hand, Godel's First Incompleteness 
Theorem can be put away in fairly short order. 

Theorem 18.4 (Godel's First Incompleteness Theorem). Suppose 
is a consistent recursive set of sentences of Cn such that T, \- A. 
Then E is not complete. 

That is, any consistent set of sentences which proves at least as 
much about the natural numbers as A does can't be both complete 

and recursive. The First Incompleteness Theorem has many variations, 
corollaries, and relatives, a few of which will be mentioned below. [17] 
is a good place to learn about more of them. 

Corollary 18.5. (1) Let T be a complete set of sentences of 
Cn such that V\J A is consistent. Then F is not recursive. 

(2) Let /S. he a recursive set of sentences such that /S.\J A is con- 
sistent. Then A is not complete. 

(3) The theory o/D^I, 

Th(Ol) — {a \ a is a sentence of Cn and a} , 
is not recursive. 

There is nothing really special about working in Cn- The proof 
of G5del's Incompleteness Theorem can be executed for any first order 
language and recursive set of axioms which allow one to code and prove 
enough facts about arithmetic. In particular, it can be done whenever 
the language and axioms arc powerful enough — as in Zermelo-Praenkel 
set theory, for example — to define the natural numbers and prove some 
modest facts about them. 

The Second Incompleteness Theorem. Godel also proved a 
strengthened version of the Incompleteness Theorem which asserts that, 
in particular, a consistent recursive set of sentences E of >Cjv cannot 
prove its own consistency. To get at it, we need to express the state- 
ment "E is consistent" in Cn- 

Problem 18.6. Suppose S is a recursive set of sentences of Cn- 
Find a sentence of Cn, which we'll denote by Con(E), such that E is 
consistent if and only if A\- Gon(E). 

Theorem 18.7 (Godel's Second Incompleteness Theorem). Let S 
be a consistent recursive set of sentences of Cn such that S h Then 
EF Con(E). 
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As with the First Incompleteness Theorem, the Second Incomplete- 
ness Theorem holds for any recursive set of sentences in a first-order 
language which allow one to code and prove enough facts about arith- 
metic. The perverse consequence of the Second Incompleteness Theo- 
rem is that only an inconsistent set of axioms can prove its own con- 
sistency. 

Truth and definability. A close relative of the Incompleteness 
Theorem is the assertion that truth in 91 = (N, S, -|-, •, E, 0) is not 
definable in Dt. To make sense of this, of course, we need to sort out 
what "truth" and "definable in DT" mean here. 

"Truth" means what it usually does in first-order logic: all we mean 
when we say that a sentence a oi £n is true in 01 is that when a is 
true when interpreted as a statement about the natural numbers with 
the usual operations. That is, a is true in 01 exactly when 01 satisfies 
a, i.e. exactly when 01 |= a. 

"Definable in 01" wc do have to define. . . 

Definition 18.1. A fc-place relation is definable in 01 if there is a 
formula oi Cn with at most vi, . . . , as free variables such that 



for every assignment s of 01. The formula Lp is said to define P in 01. 

A definition of "function definable in 01" could be made in a similar 
way, of course. Definability is a close relative of representability: 

Proposition 18.8. Suppose P is a k-place relation which is rep- 
resentable in Th(^) . Then P is definable in 01. 

Problem 18.9. Is the converse to Proposition 18.8 true? 

The question of whether truth in 01 is definable is then the question 
of whether the set of Godel codes of sentences ol Cn true in 01, 

'"Th(01)~' = { '"cr"' I cr is a sentence ol Cn and 01 |= a } , 

is definable in 01. It isn't: 

Theorem 18.10 (Tarski's Undefinability Theorem). '~Th(01)~' is 
not definable in 01. 

The impUcations. Godel's Incompleteness Theorems have some 
serious consequences. 

Since almost all of mathematics can be formalized in first-order 
logic, the First Incompleteness Theorem implies that there is no effec- 
tive procedure that will find and prove all theorems. This might be 
considered as job security for research mathematicians. 




126 



18. THE INCOMPLETENESS THEOREM 



The Second Incompleteness Theorem, on the other hand, imphes 
that we can never be completely sure that any reasonable set of axioms 
is actually consistent unless we take a more powerful set of axioms on 
faith. It follows that one can never be completely sure — faith aside — 
that the theorems proved in mathematics are really true. This might 
be considered as job security for philosophers of mathematics. 

We leave the question of who gets job security from Tarski's Unde- 
finability Theorem to you, gentle reader. . . 



Hints for Chapters 15—18 



Hints for Chapter 15. 

15.1. Compare Definition 15.2 with the definition of maximal con- 
sistency. 

Hints for Chapter 16. 

16.1. Do what is done in Example 16.2 for some other sequence of 
formulas. 

16.2. You need to unwind Definitions 16.1 and 16.2, keeping in 
mind that you are dealing with formulas and sequences of formulas, 
not just arbitrary sequences of symbolsof or sequences of sequences 
of symbols. 

16.3. In each case, use Definitions 16.1 and 16.2, along with the 
appropriate definitions from first-order logic and the tools developed 
in Problems 13.3 and 13.5. 

(1) Recall that in Cn-, a term is either a variable symbol, i.e. Vk 
for some k, the constant symbol 0, of the form St for some 
(shorter) term t, or +^1^2 for some (shorter) terms ti and ^2- 
Xterm(^) needs to check the length of the sequence coded by n. 
If this is of length 1, it will need to check if the symbol coded is 
or Vk for some k; otherwise, it needs to check if the sequence 
coded by n begins with an S or +, and then whether the rest 
of the sequence consists of one or two valid terms. Primitive 
recursion is likely to be necessary in the latter case if you can't 
figure out how to do it using the tools from Problems 13.3 and 
13.5. 

(2) This is similar to showing TERM(n) is primitive recursive. Re- 
call that in Cisr, a formula is of the form either — tit^ for some 
terms ti and t2, (A) for some (shorter) formula a, {(5 — > 7) 
for some (shorter) formulas j3 and 7, or V?;i 6 for some variable 
symbol Vi and some (shorter) formula 5. Xformula(^) needs to 
check the first symbol of the sequence coded by n to identify 
which case ought to apply and then take it from there. 
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(3) Recall that a sentence is justa formula with no free variable; 
that is, every occurrence of a variable is in the scope of a 
quantifier. 

(4) Each logical axiom is an instance of one of the schema A1-A8, 
or is a generalization thereof. 

16.4. In each case, use Definitions 16.1 and 16.2, together with the 
appropriate definitions from first-order logic and the tools developed 
in Problems 13.5 and 16.3. 

(1) '"A"' is recursive and LOGICAL is primitive recursive, so. . . 

(2) All Xformulas(?T') has to do is check that every element of the 

sequence coded by n is the code of a formula, and FORMULA 
is already known to be primitive recursive. 

(3) XinferenceI'T') needs to check that n is the code of a sequence of 
formulas, with the additional property that either (pi is {(fj — > 
(pk) or ipj is {ipi ipk)- Part of what goes into XformulaI^i) 
may be handy for checking the additional property. 

(4) Recall that a deduction from A is a sequence of formulas 
ipi . . .ipk where each formula is either a premiss or follows from 
preceding formulas by Modus Ponens. 

(5) XconclusionI'^, ''^) needs to check that n is the code of a de- 
duction and that m is the code of the last formula in that 
deduction. 

They're all primitive recursive if '"A"' is, by the way. 

16.5. (1) Use unbounded minimalization and the relations in 
Problem 16.4 to define a function which, given n, returns the 
nth integer which codes an element of Th(A). 

(2) If A is complete, then for any sentence a, either [a] or \-ia 
must eventually turn up in an enumeration of '~Th(A)~'. The 
other direction is really just a matter of unwinding the defini- 
tions involved. 

Hints for Chapter 17. 

17.16. Every deduction from F can be replaced by a deduction of 
E with the same conclusion. 

17.14. If E were insconsistent it would prove entirely too much. . . 

17.6. (1) Adapt Example 17.1. 

(2) Use the 1-place function symbol S oi Cn- 

(3) There is much less to this part than meets the eye. . . 
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17.7. In each case, you need to use the given representing formula 
to define the one you need. 

17.8. String together the formulas representing gi, . . . , Qm, and h 
with As and put some existential quantifiers in front. 

17.9. First show that that < is representable in Th(^) and then 
exploit this fact. 

17.10. (1) n I m if and only if there is some k such that n-k — 
m. 

(2) n is prime if and only if there is no £ such that i \ n and 

I < a <n. 

(3) pk is the first prime with exactly k — 1 primes less than it. 

(4) Note that k must be minimal such that v}''^^ \ m. 

(5) You'll need a couple of the previous parts. 

(6) Ditto. 

17.11. Problem 17.10 has most of the necessary ingredients needed 
here. 

17.12. Problems 17.10 and 17.11 have most of the necessary ingre- 
dients between them. 

17.13. Proceed by induction on the numbers of applications of com- 
position, primitive recursion, and unbounded minimalization in the re- 
cursive definition /, using the previous results in Chapter 17 at the 
basis and induction steps. 

Hints for Chapter 18. 

18.2. ^ is a finite set of sentences. 

18.1. First show that recognizing that a formula has at most vi as 
a free variable is recursive. The rest boils down to checking that sub- 
stituting a term for a free variable is also recursive, which has already 
had to be done in the solutions to Problem 16.3. 

18.3. Let ip be the formula (with at most Vi, f 2, and ^3 free) which 
represents the function / of Problem 18.1 in Th(^). Then the formula 
Vw3 {ip'"'^vi ifl'!^) has only one variable free, namely vi, and is very 
close to being the sentence a needed. To obtain a you need to substitute 
S'^O for a suitable k for vi. 

18.4. Try to prove this by contradiction. Observe first that if E is 
recursive, then '"Th(E)"' is representable in Th(^). 
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18.5. (1) If r were recursive, you could get a contradiction to 
the Incompleteness Theorem. 

(2) If A were complete, it couldn't also be recursive. 

(3) Note that A C Th(Dfl). 

18.6. Modify the formula representing the function CONCLUSIONS 
(defined in Problem 16.4) to get Con(E). 

18.7. Try to do a proof by contradiction in three stages. First, 
find a formula (p (with just vi free) that represents "n is the code of 
a sentence which cannot be proven from E" and use the Fixed-Point 
Lemma to find a sentence r such that S h r Lp{S^'^^). Second, show 
that if S is consistent, then S F r. Third — the hard part — show 
that S h Con(S) ip{S'~'^^). This leads directly to a contradiction. 

18.8. Note that m^A. 

18.9. If the converse was true, A would run afoul of the (First) 
Incompleteness Theorem. 

18.10. Suppose, by way of contradiction, that '~Th(Dft)~^ was defin- 
able in DT. Now follow the proof of the (First) Incompleteness Theorem 
as closely as you can. 
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APPENDIX A 



A Little Set Theory 

This apppendix is meant to provide an informal summary of the 
notation, definitions, and facts about sets needed in Chapters 1-9. For 
a proper introduction to elementary set theory, try [8] or [10]. 

Definition A.l. Suppose X and Y are sets. Then 

(1) a & X means that a is an element of {i.e. a thing in) the set 
X. 

(2) X is a subset of Y, written as X C F, if a e F for every 
a e X. 

(3) The union of X and Y is X UY = {a \ a e X or a e Y }. 

(4) The intersection of X and Y is X r\Y — {a \ a E X and a e 
Y}. 

(5) The complement of Y relative to X is X\Y = {a \ ae 
X and a ^ Y}. 

(6) The cross product of X and Y is X x Y — {{a,b) \ a E 
X and 6 e y }. 

(7) The power set of X is V{X) = {Z \ Z C X}. 

(8) [X]'' = { Z \ Z C X and \Z\ = k} is the set of subsets of X of 
size k. 

If all the sets being dealt with are all subsets of some fixed set Z, 
the complement of Y, Y, is usually taken to mean the complement 

of Y relative to Z. It may sometimes be necessary to take unions, 
intersections, and cross products of more than two sets. 

Definition A. 2. Suppose A is a set and X = {Xa|ae^}isa 
family of sets indexed by A. Then 

(1) The union of X is the set [jX = { z \ 3a e A: z e Xa}. 

(2) The intersection of X is the set f^X ^ {z {^a e A: z e Xa}. 

(3) The cross product of X is the set of sequences (indexed by A) 

nX = Ua&A^a = {{Za \ a e A) \ \/a e A: Za e Xa}. 

We will denote the cross product of a set X with itself taken n times 
{i.e. the set of all sequences of length n of elements of X) by X". 

Definition A. 3. If X is any set, a k-place relation on X is a. subset 
R C X^ 
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For example, the set E' = { 0, 2, 3, . . . } of even natural numbers is 
a 1-place relation on N, = { (x, e | x divides y } is a 2-place 
relation on N, and 5" = { (a, fo, c) e | a + 6 = c } is a 3-place relation 
on N. 2-place relations are usually called binary relations. 

Definition A. 4. A set X is finite if there is some G N such 
that X has n elements, and is infinite otherwise. X is countable if it is 
infinite and there is a 1-1 onto function / : N — >• X, and uncountable if 
it is infinite but not countable. 

Various infinite sets occur frequently in mathematics, such as N (the 
natural numbers) , Q (the rational numbers) , and R (the real numbers) . 
Many of these are uncountable, such as M. The basic facts about 
countable sets needed to do the problems are the following. 

Proposition A.l. (1) If X is a countable set and Y <Z X, 
then Y is either finite or a countable. 

(2) Suppose X = { Xn \ n E N} is a finite or countable family of 
sets such that each Xn is either finite or countable. Then (J X 
is also finite or countable. 

(3) If X is a non-empty finite or countable set, then X^ is finite 
or countable for each n>l. 

(4) If X is a non-empty finite or countable set, then the set of all 
finite sequences of elements of X, X^'^ — IJneN count- 
able. 

The properly sceptical reader will note that setting up propositional 
or first-order logic formally requires that we have some set theory in 
hand, but formalizing set theory itself requires one to have first-order 
logic. ^ 



Which came first, the chicken or the egg? Since, bibHcally speaking, "In the 
beginning was the Word" , maybe we ought to plump for alphabetical order. Which 
begs the question: In which alphabet? 
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The Greek Alphabet 
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Logic Limericks 



Deduction Theorem 

A Theorem fine is Deduction, 
For it allows work-reduction: 
To show "A implies B" , 
Assume A and prove B; 
Quite often a simpler production. 

Generalization Theorem 

When in premiss the variable's bound, 
To get a "for all" without wound. 
Generalization. 
For civilization 

Could use some help for reasoning sound. 

Soundness Theorem 

It's a critical logical creed: 

Always check that it's safe to proceed. 

To tell us deductions 

Are truthful productions, 

It's the Soundness of logic we need. 

Completeness Theorem 

The Completeness of logics is Godel's. 
'Tis advice for looking for models: 
They're always existent 
For statements consistent. 
Most helpful for logical labors. 
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GNU Free Documentation License 

Version 1.2, November 2002 

Copyright ©2000,2001,2002 Free Software Foundation, Inc. 
59 Temple Place, Suite 330, Boston, MA 02111-1307 USA 
Everyone is permitted to copy and distribute verbatim copies of 
this license document, but changing it is not allowed. 

0. PREAMBLE 

The purpose of this License is to make a manual, textbook, or 
other functional and useful document "free" in the sense of freedom: 
to assure everyone the effective freedom to copy and redistribute it, 
with or without modifying it, cither commercially or noncommercially. 
Secondarily, this License preserves for the author and publisher a way 
to get credit for their work, while not being considered responsible for 
modifications made by others. 

This License is a kind of "copyleft", which means that derivative 
works of the document must themselves be free in the same sense. 
It complements the GNU General Public License, which is a copyleft 
hcense designed for free software. 

We have designed this License in order to use it for manuals for 
free software, because free software needs free documentation: a free 
program should come with manuals providing the same freedoms that 
the software does. But this License is not limited to software manuals; 
it can be used for any textual work, regardless of subject matter or 
whether it is published as a printed book. We recommend this License 
principally for works whose purpose is instruction or reference. 

1. APPLICABILITY AND DEFINITIONS 

This License applies to any manual or other work, in any medium, 
that contains a notice placed by the copyright holder saying it can 
be distributed under the terms of this License. Such a notice grants a 
world-wide, royalty-free license, unlimited in duration, to use that work 
under the conditions stated herein. The "Document", below, refers to 
any such manual or work. Any member of the public is a licensee, and 
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is addressed as "you". You accept the license if you copy, modify or 
distribute the work in a way requiring permission under copyright law. 

A "Modified Version" of the Document means any work contain- 
ing the Document or a portion of it, either copied verbatim, or with 
modifications and/or translated into another language. 

A "Secondary Section" is a named appendix or a front-matter sec- 
tion of the Document that deals exclusively with the relationship of 
the publishers or authors of the Document to the Document's overall 
subject (or to related matters) and contains nothing that could fall 
directly within that overall subject. (Thus, if the Document is in part 
a textbook of mathematics, a Secondary Section may not explain any 
mathematics.) The relationship could be a matter of historical connec- 
tion with the subject or with related matters, or of legal, commercial, 
philosophical, ethical or political position regarding them. 

The "Invariant Sections" are certain Secondary Sections whose ti- 
tles are designated, as being those of Invariant Sections, in the notice 
that says that the Document is released under this License. If a section 
does not fit the above definition of Secondary then it is not allowed to 
be designated as Invariant. The Document may contain zero Invari- 
ant Sections. If the Document does not identify any Invariant Sections 
then there arc none. 

The "Cover Texts" arc certain short passages of text that are listed, 
as Front-Cover Texts or Back-Cover Texts, in the notice that says that 
the Document is released under this License. A Pront-Cover Text may 
be at most 5 words, and a Back-Cover Text may be at most 25 words. 

A "Transparent" copy of the Document means a machine-readable 
copy, represented in a format whose specification is available to the 
general public, that is suitable for revising the document straightfor- 
wardly with generic text editors or (for images composed of pixels) 
generic paint programs or (for drawings) some widely available drawing 
editor, and that is suitable for input to text formatters or for automatic 
translation to a variety of formats suitable for input to text formatters. 
A copy made in an otherwise Transparent file format whose markup, 
or absence of markup, has been arranged to thwart or discourage sub- 
sequent modification by readers is not Transparent. An image format 
is not Transparent if used for any substantial amount of text. A copy 
that is not "Transparent" is called "Opaque" . 

Examples of suitable formats for Transparent copies include plain 
ASCII without markup, Texinfo input format, LaTeX input format, 
SGML or XML using a publicly available DTD, and standard-conforming 
simple HTML, PostScript or PDF designed for human modification. 
Examples of transparent image formats include PNG, XGF and JPG. 
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Opaque formats include proprietary formats that can be read and 
edited only by proprietary word processors, SGML or XML for which 
the DTD and/or processing tools arc not generally available, and the 
machine-generated HTML, PostScript or PDF produced by some word 
processors for output purposes only. 

The "Title Page" means, for a printed book, the title page itself, 
plus such following pages as are needed to hold, legibly, the material 
this License requires to appear in the title page. For works in formats 
which do not have any title page as such, "Title Page" means the text 
near the most prominent appearance of the work's title, preceding the 
beginning of the body of the text. 

A section "Entitled XYZ" means a named subunit of the Docu- 
ment whose title either is precisely XYZ or contains XYZ in paren- 
theses following text that translates XYZ in another language. (Here 
XYZ stands for a specific section name mentioned below, such as "Ac- 
knowledgements" , "Dedications", "Endorsements", or "History".) To 
"Preserve the Title" of such a section when you modify the Document 
means that it remains a section "Entitled XYZ" according to this def- 
inition. 

The Document may include Warranty Disclaimers next to the no- 
tice which states that this License applies to the Document. These 
Warranty Disclaimers are considered to be included by reference in 
this License, but only as regards disclaiming warranties: any other im- 
plication that these Warranty Disclaimers may have is void and has no 
effect on the meaning of this License. 



2. VERBATIM COPYING 

You may copy and distribute the Document in any medium, ei- 
ther commercially or noncommercially, provided that this License, the 
copyright notices, and the license notice saying this License applies to 
the Document are reproduced in all copies, and that you add no other 
conditions whatsoever to those of this License. You may not use tech- 
nical measures to obstruct or control the reading or further copying of 
the copies you make or distribute. However, you may accept compen- 
sation in exchange for copies. If you distribute a large enough number 
of copies you must also follow the conditions in section 3. 

You may also lend copies, under the same conditions stated above, 
and you may publicly display copies. 



142 



D. GNU FREE DOCUMENTATION LICENSE 



3. COPYING IN QUANTITY 

If you publish printed copies (or copies in media that commonly 
have printed covers) of the Document, numbering more than 100, and 
the Document's hcense notice requires Cover Texts, you must enclose 
the copies in covers that carry, clearly and legibly, all these Cover 
Texts: Front-Cover Texts on the front cover, and Back-Cover Texts on 
the back cover. Both covers must also clearly and legibly identify you as 
the publisher of these copies. The front cover must present the full title 
with all words of the title equally prominent and visible. You may add 
other material on the covers in addition. Copying with changes limited 
to the covers, as long as they preserve the title of the Document and 
satisfy these conditions, can be treated as verbatim copying in other 
respects. 

If the required texts for either cover are too voluminous to fit legibly, 
you should put the first ones listed (as many as fit reasonably) on the 
actual cover, and continue the rest onto adjacent pages. 

If you publish or distribute Opaque copies of the Document num- 
bering more than 100, you must either include a machine-readable 
Transparent copy along with each Opaque copy, or state in or with 
each Opaque copy a computer-network location from which the gen- 
eral network-using public has access to download using public-standard 
network protocols a complete Transparent copy of the Document, free 
of added material. If you use the latter option, you must take reason- 
ably prudent steps, when you begin distribution of Opaque copies in 
quantity, to ensure that this Transparent copy will remain thus accessi- 
ble at the stated location until at least one year after the last time you 
distribute an Opaque copy (directly or through your agents or retailers) 
of that edition to the public. 

It is requested, but not required, that you contact the authors of 
the Document well before redistributing any large number of copies, 
to give them a chance to provide you with an updated version of the 
Document. 



4. MODIFICATIONS 

You may copy and distribute a Modified Version of the Document 
under the conditions of sections 2 and 3 above, provided that you re- 
lease the Modified Version under precisely this License, with the Modi- 
fied Version filling the role of the Document, thus licensing distribution 
and modification of the Modified Version to whoever possesses a copy 
of it. In addition, you must do these things in the Modified Version: 
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A. Use in the Title Page (and on the covers, if any) a title distinct 
from that of the Document, and from those of previous ver- 
sions (which should, if there were any, be listed in the History 
section of the Document). You may use the same title as a 
previous version if the original publisher of that version gives 
permission. 

B. List on the Title Page, as authors, one or more persons or 
entities responsible for authorship of the modifications in the 
Modified Version, together with at least five of the principal 
authors of the Document (all of its principal authors, if it has 
fewer than five), unless they release you from this requirement. 

C. State on the Title page the name of the publisher of the Mod- 
ified Version, as the publisher. 

D. Preserve all the copyright notices of the Document. 

E. Add an appropriate copyright notice for your modifications 
adjacent to the other copyright notices. 

F. Include, immediately after the copyright notices, a license no- 
tice giving the public permission to use the Modified Version 
under the terms of this License, in the form shown in the Ad- 
dendum below. 

G. Preserve in that license notice the full lists of Invariant Sec- 
tions and required Cover Texts given in the Document's license 
notice. 

H. Include an unaltered copy of this License. 

I. Preserve the section Entitled "History", Preserve its Title, and 
add to it an item stating at least the title, year, new authors, 
and publisher of the Modified Version as given on the Title 
Page. If there is no section Entitled "History" in the Docu- 
ment, create one stating the title, year, authors, and publisher 
of the Document as given on its Title Page, then add an item 
describing the Modified Version as stated in the previous sen- 
tence. 

J. Preserve the network location, if any, given in the Document 
for public access to a Transparent copy of the Document, and 
likewise the network locations given in the Document for pre- 
vious versions it was based on. These may be placed in the 
" History" section. You may omit a network location for a work 
that was published at least four years before the Document it- 
self, or if the original publisher of the version it refers to gives 
permission. 

K. For any section Entitled "Acknowledgements" or "Dedications", 
Preserve the Title of the section, and preserve in the section 
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all the substance and tone of each of the contributor acknowl- 
edgements and/or dedications given therein. 

L. Preserve all the Invariant Sections of the Document, unaltered 
in their text and in their titles. Section numbers or the equiv- 
alent are not considered part of the section titles. 

M. Delete any section Entitled "Endorsements". Such a section 
may not be included in the Modified Version. 

N. Do not retitle any existing section to be Entitled "Endorse- 
ments" or to conflict in title with any Invariant Section. 

O. Preserve any Warranty Disclaimers. 

If the Modified Version includes new front-matter sections or appen- 
dices that qualify as Secondary Sections and contain no material copied 
from the Document, you may at your option designate some or all of 
these sections as invariant. To do this, add their titles to the list of 
Invariant Sections in the Modified Version's license notice. These titles 
must be distinct from any other section titles. 

You may add a section Entitled "Endorsements" , provided it con- 
tains nothing but endorsements of your Modified Version by various 
parties-for example, statements of peer review or that the text has 
been approved by an organization as the authoritative definition of a 
standard. 

You may add a passage of up to five words as a Front-Cover Text, 
and a passage of up to 25 words as a Back-Cover Text, to the end 
of the list of Cover Texts in the Modified Version. Only one passage 
of Pront-Cover Text and one of Back-Cover Text may be added by 
(or through arrangements made by) any one entity. If the Document 
already includes a cover text for the same cover, previously added by 
you or by arrangement made by the same entity you are acting on 
behalf of, you may not add another; but you may replace the old one, 
on explicit permission from the previous publisher that added the old 
one. 

The author (s) and publisher (s) of the Document do not by this 
License give permission to use their names for publicity for or to assert 
or imply endorsement of any Modified Version. 

5. COMBINING DOCUMENTS 

You may combine the Document with other documents released un- 
der this License, under the terms defined in section 4 above for modified 
versions, provided that you include in the combination all of the Invari- 
ant Sections of all of the original documents, unmodified, and list them 
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all as Invariant Sections of your combined work in its license notice, 
and that you preserve all their Warranty Disclaimers. 

The combined work need only contain one copy of this License, 
and multiple identical Invariant Sections may be replaced with a single 
copy. If there are multiple Invariant Sections with the same name but 
different contents, make the title of each such section unique by adding 
at the end of it, in parentheses, the name of the original author or 
publisher of that section if known, or else a unique number. Make the 
same adjustment to the section titles in the list of Invariant Sections 
in the license notice of the combined work. 

In the combination, you must combine any sections Entitled "His- 
tory" in the various original documents, forming one section Enti- 
tled "History"; likewise combine any sections Entitled "Acknowledge- 
ments", and any sections Entitled "Dedications". You must delete all 
sections Entitled "Endorsements." 

6. COLLECTIONS OF DOCUMENTS 

You may make a collection consisting of the Document and other 
documents released under this License, and replace the individual copies 
of this License in the various documents with a single copy that is 
included in the collection, provided that you follow the rules of this 
License for verbatim copying of each of the documents in all other 
respects. 

You may extract a single document from such a collection, and 
distribute it individually under this License, provided you insert a copy 
of this License into the extracted document, and follow this License in 
all other respects regarding verbatim copying of that document. 

7. AGGREGATION WITH INDEPENDENT WORKS 

A compilation of the Document or its derivatives with other sep- 
arate and independent documents or works, in or on a volume of a 
storage or distribution medium, is called an "aggregate" if the copy- 
right resulting from the compilation is not used to limit the legal rights 
of the compilation's users beyond what the individual works permit. 
When the Document is included an aggregate, this License does not 
apply to the other works in the aggregate which are not themselves 
derivative works of the Document. 

If the Cover Text requirement of section 3 is applicable to these 
copies of the Document, then if the Document is less than one half of the 
entire aggregate, the Document's Cover Texts may be placed on cov- 
ers that bracket the Document within the aggregate, or the electronic 
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equivalent of covers if tlie Document is in electronic form. Otherwise 
they must appear on printed covers that bracket the whole aggregate. 

8. TRANSLATION 

Translation is considered a kind of modification, so you may dis- 
tribute translations of the Document under the terms of section 4. Re- 
placing Invariant Sections with translations requires special permission 
from their copyright holders, but you may include translations of some 
or all Invariant Sections in addition to the original versions of these 
Invariant Sections. You may include a translation of this License, and 
all the license notices in the Document, and any Warrany Disclaimers, 
provided that you also include the original English version of this Li- 
cense and the original versions of those notices and disclaimers. In case 
of a disagreement between the translation and the original version of 
this License or a notice or disclaimer, the original version will prevail. 

If a section in the Document is Entitled "Acknowledgements" , "Ded- 
ications", or "History", the requirement (section 4) to Preserve its Title 
(section 1) will typically require changing the actual title. 

9. TERMINATION 

You may not copy, modify, sublicense, or distribute the Document 
except as expressly provided for under this License. Any other attempt 
to copy, modify, sublicense or distribute the Document is void, and 
will automatically terminate your rights under this License. However, 
parties who have received copies, or rights, from you under this License 
will not have their licenses terminated so long as such parties remain 
in full compliance. 

10. FUTURE REVISIONS OF THIS LICENSE 

The Free Software Foundation may publish new, revised versions of 
the GNU Free Documentation License from time to time. Such new ver- 
sions will be similar in spirit to the present version, but may differ in de- 
tail to address new problems or concerns. See http: / /www. gnu.org/copyleft/. 

Each version of the License is given a distinguishing version number. 
If the Document specifies that a particular numbered version of this 
License "or any later version" applies to it, you have the option of 
following the terms and conditions either of that specified version or 
of any later version that has been published (not as a draft) by the 
Free Software Foundation. If the Document does not specify a version 
number of this License, you may choose any version ever published (not 
as a draft) by the Free Software Foundation. 
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